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Abstract 

The paper studies the convergence, as N tends to infinity, of a system of N coupled 
Hamilton-Jacobi equations, the Nash system. This system arises in differential game theory. 
We describe the limit problem in terms of the so-called “master equation”, a kind of second 
order partial differential equation stated on the space of probability measures. Our first main 
result is the well-posedness of the master equation. To do so, we first show the existence and 
uniqueness of a solution to the “mean field game system with common noise”, which consists 
in a coupled system made of a backward stochastic Hamilton-Jacobi equation and a forward 
stochastic Kolmogorov equation and which plays the role of characteristics for the master 
equation. Our second main result is the convergence, in average, of the solution of the Nash 
system and a propagation of chaos property for the associated “optimal trajectories”. 
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Statement of the problem. The purpose of this paper is to discuss the behavior, as N tends 
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to infinity, of the strongly coupled system of N parabolic equations 
r N N 

a;) - 2 x) - (3 Yj x) + H{xi, x)) 

j=i j,k=i 

< +YDpH{xj,D^.v^’^{t,x)) ■ D^.v^’\t,x) = F^’\x) (1) 

jj^i 

in [0,r] X 

^ v^’\T,x) = G^’\x) in 

The above system is stated in [0,T] x where a typical element is denoted by {t,x) with 

X = {xi,... ,xn) e The unknowns are the N maps (r’^’*)iE{i,...,Ar}. The data are the 

Hamiltonian H : x ^ M., the maps ^ M, the nonnegative parameter (3 

and the horizon T ^ 0. 

System ([T|) describes the Nash equilibria of an iV—player differential game (see Section 11.21 
for a short description). In this game, the set of “optimal trajectories” solves a system of N 
coupled stochastic differential equations (SDE): 

dX,^t =-DpH{X,^t,Dv^’\t,Xt))dt + V2dBl + ^/^dWt, te[0,T], ie{l,...,N}, (2) 

where is the solution to ([T]) and the ((Hj)jg|-o^'r])i=i,...,Ar and d—dimensional 

independent Brownian motions. The Brownian motions ((H^)te[o,T])i=i,...,iV correspond to the 
individual noises, while the Brownian motion {Wt)te[o,T] is the same for all the equations and, 
for this reason, is called the common noise. Under such a probabilistic point of view, the collec¬ 
tion of random process ((^i,i)tg[o,r])i=i,...,Af forms a dynamical system of interacting particles. 
Another, but closely related, objective of our paper is to study the mean-field limit of the 
((^*,t)ts[o,T])i=i,...,7V as N tends to infinity. 

As explained below, the motivation for investigating ([T|) and (l2|) asymptotically is to justify 
the passage to the limit in the theory of mean-held games. 

Link with the mean-field theory. Of course, there is no chance to observe a mean-held limit 
for ([2]) under a general choice of the coefficients in ([T|). Asking for a mean-held limit certainly 
requires that the system has a specihc symmetric structure in such a way that the players in 
the differential game are somewhat exchangeable (when in equilibrium). For this purpose, we 
suppose that, for each z e {1,..., N}, the maps (E'^)'^ 3 x ^ F^’^{x) and (E'^)^ 3 x ^ G^’^{x) 
depend only on Xi and on the empirical distribution of the variables (xj)j^j: 

F^’\x) = F{xi,m^’^) and G^’*(£c) = G(a:j, m^’*), (3) 

where empirical distribution of the [xfij^i and where F,G:W^x 

P(E'^) ^ E are given functions, P(E'^) being the set of Borel measures on E'^. Under this 
assumption, the solution of the Nash system indeed enjoys strong symmetry properties, which 
imply in particular the required exchangeability property. Namely, can be written into a 
similar form to dS]): 

v^’'^{t,x) = {t,Xi,m^’^), te[0, T], x e , (4) 

for a function v^{t, •, •) taking as arguments a state in E'^ and an empirical distribution of size 
A^ — 1 over E'^. 

Anyhow, even under the above symmetry assumptions, it is by no means clear whether the 
system m can exhibit a mean-field limit. The reason is that the dynamics of the particles 
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(Xi, ts[o,T] coupled through the unknown solutions ,... to the Nash 

system m, whose symmetry properties ([H) may not suffice to apply standard results from the 
theory of propagation of chaos. Obviously, the difficulty is that the function in the right-hand 
side of ([H precisely depends upon N. Part of the challenge in the paper is thus to show that 
the interaction terms in ([2]) get closer and closer, as N tends to the infinity, to some interaction 
terms with a much more tractable and much more explicit shape. 

In order to get a picture of the ideal case under which the mean-field limit can be taken, 
one can choose for a while /? = 0 in ([2]) and then assume that the function in the right-hand 
side of (jH) is independent of N. Equivalently, one can replace in ([2]) the interaction function 
(]^(i)Af 3 2 ; DpH{xi,v^'^{t,x)) by 3 x ^ b{xi,mx’^), for a map 6 ; E"^ x P(E‘^) >—>■ E'^. 

In such a case, the coupled system of SDEs (l2|) turns into: 

dXi,t = b[Xi,t,j^J]Sxj,,)dt + V2dBi, te[0,T], ie{l,...,N}, (5) 

the second argument in b being nothing but the empirical measure of the particle system at 
time t. Under suitable assumptions on b (e.g., if b is bounded and Lipschitz continuous in both 
variables, the space of probability measures being equipped with the Wasserstein distance) and 
on the initial distribution of the both the marginal law of (X^^)jg[o,T] (or 

of any other player) and the empirical distribution of the whole system converge to the solution 
of the McKean-Vlasov equation 

dtm — Am + div(m 6(-, m)) = 0. 

(see, among many other references, McKean [53], Sznitman |64| . Meleard |55]....l. The standard 
strategy for establishing the convergence consists in a coupling argument. Precisely, if one 
introduces the system of N independent equations 

dYi^t = b{Yi,t,C{Yi,t)) dt + V2dBi, t e [0,r], i e {I,..., iV}, 

(where CiYi^t) is the law of V,t) with the same (chaotic) initial condition as that of the pro¬ 
cesses ((Xj^t)ig[o,T])i=i,...,Af, then it is known that (under appropriate integrability conditions, 
see Fournier and Guillin [26]) 

sup E [\Xi^t - Yi^t\] ^ (l{d=^ 2 } + ln(l + N)l{d^ 2 }) ■ 

te[0,T] 

In comparison with ([5]), all the equations in ([2]) are subject to the common noise {Wt)te[o,T]-, 
at least when /3 =|= 0. This makes a first difference between our limit problem and the above 
McKean-Vlasov example of interacting diffusions, but, for the time being, it is not clear how 
deep this may affect the analysis. Indeed, the presence of a common noise does not constitute a 
real challenge in the study of McKean-Vlasov equations, the above coupling argument working 
in that case as well, provided that the distribution of Y is replaced by its conditional distribution 
given the realization of the common noise. However, the key point here is precisely that our 
problem is not formulated as a McKean-Vlasov equation, since the drifts in ([2]) are not of the 
same explicit mean-field structure as they are in ([5]) because of the additional dependence upon 
N in the right-hand side of ([H -obviously this is the second main difference between ([2]) and 
([5]) -. This makes rather difficult any attempt to guess the precise impact of the common noise 
onto the analysis. For sure, as we already pointed out, the major issue for analyzing ([2]) is 
caused by the complex nature of the underlying interactions. As the equations depend upon 
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one another through the nonlinear system the evolution with N of the coupling between 
all of them is indeed much more intricate than in ([5]). And once again, on the top of that, the 
common noise adds another layer of difficulty. For these reasons, the convergence of both ([T]) 
and ([2]) has been an open question since Lasry and Lions’ initial papers on mean field games 

[miii]. 

The mean field game system. The analysis of the Nash system m as the number of players 
is large pops up very naturally in game theory. Similar questions for static games were studied 
a long time ago by Aumann, who introduced the concept of nonatomic games in [7]; moreover, 
Schmeidler [63] and Mas-Colell |5l| defined and investigated non-cooperative Nash equilibria for 
one shot games with infinitely many small players. 

In the case of differential games, the theory is known under the name of “mean-field games”, 
whose principle goes as follows. If one tries, at least in the simpler case /3 = 0, to describe -in 
a heuristic way- the structure of a game with infinitely many indistinguishable players, i.e., a 
“nonatomic differential game”, one finds a problem in which each (infinitesimal) player optimizes 
his payoff, depending upon the collective behavior of the others, and, meanwhile, the resulting 
optimal state of each of them is exactly distributed according to the state of the population. 
This is the “mean field game system” (MFG system): 

f —dtu — Au + H{x,Du)=F{x,m{t)) in[0,T]x]R'^, 

< dtm — Am — div{mDpH{x, Du)) = 0 in [0, T] x M'’*, (6) 

[ u{T,x) = G{x,m(T)), m(0, •) = m(o) in M'’*, 

where m(o) denotes the initial state of the population. The system consists in a coupling between 
a (backward) Hamilton-Jacobi equation, describing the dynamics of the value function of any of 
the players, and a (forward) Kolmogorov equation, describing the dynamics of the distribution 
of the population. In that framework, H reads as an Hamiltonian, F is understood as a running 
cost and G as a terminal cost. Since its simultaneous introduction by Lasry and Lions [50| and by 
Huang, Caines and Malhame [32], this system has been thoroughly investigated: existence, under 
various assumptions, can be found in [H]HT]|33llMlES]HOllSni[52] . Concerning uniqueness of the 
solution, two regimes were identified in [50]. Uniqueness holds under Lipschitz type conditions 
when the time horizon T is short (or, equivalently, when F and G are “small”), but, as for 
finite-dimensional two-point boundary value problems, it may fail when the system is set over a 
time interval of arbitrary length. Over long time intervals, uniqueness is guaranteed under the 
quite fascinating condition that F and G are monotonous, i.e., if, for any measures m,m', the 
following holds: 

/ {F{x,m) — F{x,m')d{m — m'){x) ^ d and / {G{x,m) — G{x,m')d{m — m'){x) ^ 0. (7) 
jRd 

The interpretation of the monotonicity condition is that the players dislike congested areas and 
favor configurations in which they are more scattered, see Remark 12.61 below for an example. 
Generally speaking, condition ([7]) plays a key role throughout the paper, as it guarantees not 
only uniqueness but also stability of the solutions to ([6]). 

As announced, a solution to the mean held game system ([6]) can be indeed interpreted as 
a Nash equilibrium for a differential game with inhnitely many players: in that framework, it 
plays the role of the Schmeidler’s non-cooperative equilibrium. A standard strategy to make 
the connection between ([6]) and differential games consists in inserting the optimal strategies 
from the Hamilton-Jacobi equation in ([6]) into hnitely many player games in order to construct 
approximate Nash equilibria: see [33], as well as [miMiisiiiio]. However, although it establishes 
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the interpretation of the system ([6]) as a differential game with infinitely many players, this says 
nothing about the convergence of ([I]) and (p|). 

When (5 is positive, the system describing Nash equilibria within a population of infinitely 
many players subject to the same common noise of intensity {3 cannot be longer described by 
a deterministic system of the same form as (l6|). Owing to the theory of propagation of chaos 
for systems of interacting particles, see the short remark above, the unknown m in the forward 
equation is then expected to represent the conditional law of the optimal state of any player 
given the realization of the common noise. In particular, it must be random. This turns the 
forward Kolmogorov equation into a forward stochastic Kolmogorov equation. As the Hamilton- 
Jacobi equation depends on m, it renders u random as well. Anyhow, a key fact from the theory 
of stochastic processes is that the solution to a stochastic differential equation must be adapted 
to the underlying observation, as its values at some time t cannot anticipate the future of the 
noise after t. At first sight, it seems to be very demanding as u is also required to match, at 
time T, G{-,m(T)), which depends on the whole realization of the noise up until T. The right 
formulation to accommodate both constraints is given by the theory of backward stochastic 
differential equations, which suggests to penalize the backward dynamics by a martingale in 
order to guarantee that the solution is indeed adapted. We refer the reader to the monograph 
|59] for a complete account on the finite dimensional theory and to the paper [60] for an insight 
into the infinite dimensional case. Denoting by W “the common noise” (here, a d—dimensional 
Brownian motion) and by m(o) the initial distribution of the players at time to, the MFG system 
with common noise then takes the form (in which the unknown are now {ut,mt,Vt)). 

dm = {-(1 + /3)Aut + H{x,Dut) - F{x,mt) - ^/^diY{vt)}dt + vt ■ \/^dWt 

in [0, T] X T'^, 

^ dtmt = [(1 + P)Amt + diy{mtDpH{mt, Dut))]dt - diY{mt\^dWt), (8) 

in [0, T] X T"*, 

ut{x) = G{x,mT), 'mo = ^-(0)) in 

where we used the standard convention from the theory of stochastic processes that consists 
in indicating the time parameter as an index in random functions. As suggested right above, 
the map vt is a random vector field that forces the solution ut of the backward equation to be 
adapted to the filtration generated by {Wt)te[o,T]- As far as we know, the system ([8|) has never 
been investigated and part of the paper will be dedicated to its analysis (see however [T9| for an 
informal discussion). Below, we call the system ([8|) the MFG system with common noise. 

It is worth mentioning that the aggregate equations Q and dH) (see also the master equation 
([U|) below) are the continuous time analogues of equations that appear in the analysis of dynamic 
stochastic general equilibria in heterogeneous agent models, as introduced in economic theory by 
Aiyagari [3] , Bewley [T3| and Huggett m- In this setting, the factor [3 describes the intensity of 
“aggregate shocks”, as discussed by Krusell and Smith in the seminal paper |42j . In some sense, 
the limit problem studied in the paper is an attempt to deduce the macroeconomic models, 
describing the dynamics of a typical (but heterogeneous) agent in an equilibrium configuration, 
from the microeconomic ones (the Nash equilibria). 

The master equation. Although the mean field game system has been widely studied since its 
introduction in |50| and |32|, it has become increasingly clear that this system was not sufficient 
to take into account the entire complexity of dynamic games with infinitely many players. The 
need for reformulating the original system Q into the much more complex stochastic version 
([8|) in order to accommodate with the common noise (i.e., the case f3 > 0) sounds as a hint in 
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that direction. In the same spirit, we may notice that the original MFG system ([6]) does not 
accommodate with mean field games with a major player and infinitely many small players, see 
|31j . And, last but not the least, the main limitation is that, so far, the formulation based on 
the system Q (or ([8]) when /3 > 0) has not permitted to establish a clear connection with the 
Nash system ©• 

These issues led Lasry and Lions [52] to introduce an infinite dimensional equation -the so- 
called “master equation”- that directly describes, at least formally, the limit of the Nash system 
p!|) and encompasses the above complex situations. Before writing down this equation, let us 
explain its main features. One of the key observations has to do with the symmetry properties, 
to which we already alluded, that are satisfied by the solution of the Nash system m- Under the 
standing symmetry assumptions ([3|) on the and (Hj) says that the 

can be written into a similar form to ([3|), namely v^’^{t,x) = {t,Xi,mx’^) (where 

the empirical measures are defined as in (|3|)), but with the obvious but major restriction 
that the function that appears on the right-hand side of the equality now depends upon 
N. With such a formulation, the value function to player i reads as a function of the private 
state of player i and of the empirical distribution formed by the others. Then, one may guess, 
at least under the additional assumption that such a structure is preserved as N —> -l-oo, that 
the unknown in the limit problem takes the form U = U{t,x,m), where x is the position of the 
(typical) small player at time t and m is the distribution of the (infinitely many) other agents. 

The question is then to write down the dynamics of U. Plugging U = U{t, Xi, nix''') into the 
Nash system ([T]), one obtains—at least formally—an equation stated in the space of measures 
(see Subsection o for a heuristic discussion). This is the so-called master equation. It takes 
the form: 

^ -dtU-{l + /3)AxU + H{x,DxU) 

-{1 + 13) I divy[DmU] dm{y) + [ DmU ■ DpH{y, D^U) dm{y) 

—2(3 I diYx[DmU]dm{y) — P I dm ^ dm = F{x,m) 

in [0,r] X X P(M'^) 

U{T,x,m) = G{x,m) inM'^xP(M‘^) 

In the above equation, dtU, D^U and A^U stand for the usual time derivative, space derivatives 
and Laplacian with respect to the local variables {t, x) of the unknown U, while DmU and 
D'^^U are the first and second order derivatives with respect to the measure m. The precise 
definition of these derivatives is postponed to Section [2j For the time being, let us just note that 
it is related with the derivatives in the space of probability measures described, for instance, by 
Ambrosio, Gigli and Savare in |3] and by Lions in [52]. It is worth mentioning that the master 
equation (j9|) is not the first example of an equation studied in the space of measures -by far: 
for instance Otto [58j gave an interpretation of the porous medium equation as an evolution 
equation in the space of measures, and Jordan, Kinderlehrer and Otto [38] showed that the heat 
equation was also a gradient flow in that framework; notice also that the analysis of Hamilton- 
Jacobi equations in metric spaces is partly motivated by the specific case when the underlying 
metric space is the space of measures (see in particular um and the references therein)-. The 
master equation is however the first one to combine at the same time the issue of being nonlocal, 
nonlinear and of second order. 

Beside the discussion in [52], the importance of the master equation ([9]) has been acknowl¬ 
edged by several contributions: see for instance the monograph m and the companion papers 
m and [T^ in which Bensoussan, Frehse and Yam generalize this equation to mean field type 
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control problems and reformulate it as a PDE set on an space, m where Carmona and 
Delarue interpret this equation as a decoupling field of forward-backward stochastic differential 
equation in infinite dimension. 

If the master equation has been discussed and manipulated thoroughly in the above refer¬ 
ences, it is mostly at a formal level: The well-posedness of the master equation has remained, 
to a large extend, open until now. Beside, even if the master equation has been introduced to 
explain the convergence of the Nash system, the rigorous justification of the convergence has 
not been understood. 

The aim of the paper is to give an answer to both questions. 

Well-posedness of the master equation. The largest part of this paper is devoted to the 
proof of the existence and uniqueness of a classical solution to the master equation ([9]), where, by 
classical, we mean that all the derivatives in Q exist and are continuous. In order to avoid issues 
related to boundary conditions or conditions at infinity, we work for simplicity with periodic 
data: the maps H, F and G are periodic in the space variable. The state space is therefore the d- 
dimensional torus T'^ = and m(o) belongs to 'P(T‘^), the set of Borel probability measures 

on T^. We also assume that T, G : T'^ x P(T'^) ^ M satisfy the monotonicity conditions ([7|), are 
sufficiently “differentiable” with respect to both variables and, of course, periodic with respect 
to the state variable. Although the periodicity condition is rather restrictive, the extension to 
maps defined on the full space or to Neumann boundary conditions is probably not a major 
issue. Anyhow, it would certainly require further technicalities, which would have made the 
paper even longer than it is if we had decided to include them. 

So far, the existence of classical solutions to the master equation has been known in more re¬ 
stricted frameworks. Lions discussed in [52] a finite dimensional analogue of the master equation 
and derived conditions for this hyperbolic system to be well-posed. These conditions correspond 
precisely to the monotonicity property ([7|), which we here assume to be satisfied by the coupling 
functions F and G. This parallel strongly indicates -but this should not does not come as a 
surprise- that the monotonicity of F and G should play a key role in the unique strong solv¬ 
ability of (jH). Lions also explained in [52| how to get the well-posedness of the master equation 
without noise (no Laplacian in the equation) by extending the equation to a (fixed) space of 
random variables under a convexity assumption in space of the data. In m Buckdahn, Li, 
Peng and Rainer studied equation ([9|), by means of probabilistic arguments, when there is no 
coupling nor common noise {F = G = 0, (3 = 0) and proved the existence of a classical solution 
in this setting; in a somewhat similar spirit, Kolokoltsov, Li and Yang [iQ] and Kolokoltsov, 
Troeva and Yang [H] investigated the tangent process to a flow of probability measures solving 
a McKean-Vlasov equation. Gangbo and Swiech [28] analyzed the first order master equation in 
short time (no Laplacian in the equation) for a particular class of Hamiltonians and of coupling 
functions F and G (which are required to derive from a potential in the measure argument). 
Chassagneux, Crisan and Delarue m obtained, by a probabilistic approach similar to that used 
in m, the existence and uniqueness of a solution to Q without common noise (when /3 = 0) 
under the monotonicity condition ([^ in either the non degenerate case (as we do here) or in the 
degenerate setting provided that F, H and G satisfy an additional convexity conditions in the 
variables {x,p). The complete novelty of our result, regarding the specific question of solvability 
of the master equation, is the existence and uniqueness of a classical solution to the problem 
with common noise. 

The technique of proof in [16( [22l 128] consists in finding a suitable representation of the 
solution: indeed a key remark in Lions [52] is that the master equation is a kind of transport 
equation in the space of measures and that its characteristics are, when /3 = 0, the MFG system 
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l|6|l. Using this idea, the main difficulty is then to prove that the candidate is smooth enough to 
perform the computation showing that it is a classical solution of Q. In [161122] this is obtained 
by linearizing systems of forward-backward stochastic differential equations, while [28] relies on 
a careful analysis of the characteristics of the associated first order PDE. 

Our starting point is the same: we use a representation formula for the master equation. 
When /3 = 0, the characteristics are just the solution to the MFG system Q. When fS is positive, 
these characteristics become random under the action of the common noise and are then given 
by the solution of the MFG system with common noise ([8|). 

The construction of a solution U to the master equation then relies on the method of char¬ 
acteristics. Namely, we define U by letting U{to, x,mo) := utfix) where the pair {ut.,mt)te[to,T] 
is the solution to (|8|) when the forward equation is initialized at m(o) e at time to, that is 

dm = {-(1 -f fi)Aut + H{x, Dut) - F{x, mt) - ■\/^dYv{vt)]dt + vt ■ -y/^dWt 

in [to,T] X T"*, 

^ dtmt = [(1 -f fi)Amt + div{mtDpH{nit, Dut))]dt - div(mt^/^dWt), (10) 

in [to, T] X 

ut{x) = G{x,mT), mtQ = m(o) in T'^, 

There are two main difficult steps in the analysis. The first one is to establish the smoothness 
of U and the second one is to show that U indeed satisfies the master equation Q. In order to 
proceed, the cornerstone is to make a systematic use of the monotonicity properties of the maps 
F and G: Basically, monotonicity prevents the emergence of singularities in finite time. Our 
approach seems to be very powerful, although the reader might have a different feeling due to 
the length of the paper. As a matter of fact, part of the technicalities in the proof are caused by 
the stochastic aspect of the characteristics (fTOjl . As a result, we spend much effort to handle the 
case with a common noise (for which almost nothing has been known so far), but, in the simpler 
case /3 = 0, our strategy to handle the first order master equation provides a much shorter proof 
than in the earlier works [I6l|22l[28|. For this reason, we decided to display the proof in this 
simple context separately (Section [3|). 

The convergence result. Although most of the paper is devoted to the construction of a 
solution to the master equation, our main (and primary) motivation remains to justify the 
mean field limit. Namely, we show that the solution of the Nash system ([T|) converges to the 
solution of the master equation. The main issue here is the complete lack of estimates on 
the solutions to this large system of Hamilton-Jacobi equations: This prevents the use of any 
compactness method to prove the convergence. So far, this question has been almost completely 
open. The convergence has been known in very few specific situations. For instance, it was 
proved for the ergodic mean field games (see Lasry-Lions m, revisited by Bardi-Feleqi [9|). In 
this case, the Nash equilibrium system reduces to a coupled system of N equations in T'^ (instead 
of N equations in as ([T|)) and estimates of the solutions are available. Convergence is also 
known in the “linear-quadratic” setting, where the Nash system has explicit solutions: see Bardi 
[8]. Let us finally quote the nice results by Fischer [25] and Lacker [45] on the convergence of 
open loop Nash equilibria for the N—player game and the characterization of the possible limits. 
Therein, the authors overcome the lack of strong estimates on the solutions to the N—player 
game by using the notion of relaxed controls for which weak compactness criteria are available. 
The problem addressed here—concerning closed loop Nash equilibria —differs in a substantial 
way from [25] 145] : Indeed, we underline the surprising fact that the Nash system ([T]), which 
concerns equilibria in which the players observe each other, converges to an equation in which 
the players only need to observe the evolution of the distribution of the population. 



Our main contribution is a general convergence result, in large time, for mean field games 
with common noise, as well as an estimate of the rate of convergence. The convergence holds in 
the following sense: for any x e let := ^xi- Then 
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We also prove a mean field result for the optimal solutions (l2|): if the initial conditions of the 
,_jv are i.i.d. and with the same law m(o) £ P(T'^), then 
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where the ((li,t)j=i,...,7v)iE[o,T] are the solutions to the McKean-Vlasov SDE 

dYi^t = -DpH{Yi^uD^U{t,Y^,uC{Y^,t\W)))dt + V2dBl + ^^dWt, t e [to,T], 

with the same initial condition as the ((Vj^i)j=i^...^jv)ts[o,T]- Here U is the solution of the master 
equation and C(Yi^t\W) is the conditional law of given the realization of the whole path W. 
Since the ((li,t)fE[o,T])j=i,...,Af are conditionally independent given W, the above result shows 
that (conditional) propagation of chaos holds for the V—Nash equilibria. 

The technique of proof consists in testing the solution U of the master equation ([^ as a 
nearly solution to the V—Nash system ©• On the model of ([3]), a natural candidate for being 
an approximate solution to the V—Nash system is indeed 

u^’'^{t,x) = te[0,T], a; e 

Taking benefit from the smoothness of U, we then prove that the “proxies” almost 

solve the V—Nash system m up to a remainder term that vanishes as N tends to oo. As 
a by-product, we deduce that the get closer and closer to the “true solutions” 

when N tends to oo, which yields (jllh . As the reader may notice, the convergence 
property (11111 is stated in a symmetric form, namely the convergence holds in the mean, the 
average being taken over all the particles. Of course, this is reminiscent of the symmetry 
properties satisfied by the V—Nash system, which play a crucial role in the proof. 

It is worth mentioning that the monotonicity properties (l3|) play no role in our proof of the 
convergence. Except structural conditions concerning the Lipschitz property of the coefficients, 
the arguments work under the sole assumption that the master equation has a classical solution. 


Conclusion and further prospects. The fact that the existence of a classical solution to 
the master equation suffices to prove the convergence of the Nash system demonstrates the 
deep interest of the master equation, when regarded as a mathematical concept in its own right. 
Considering the problem from a more abstract point of view, the master equation indeed captures 
the evolution of the time-dependent semi-group generated by the Markov process formed, on 
the space of probability measures, by the forward component of the MEG system (|10ll . Such a 
semi-group is said to be lifted as the corresponding Markov process has 'P(T‘^) as state space. 
In other words, the master equation is a nonlinear PDE driven by a Markov generator acting 
on functions defined on V{T‘^). The general contribution of our paper is thus to show that any 
classical solution to the master equation accommodates with a given perturbation of the lifted 
semi-group and that the information enclosed in such a classical solution suffices to determine the 
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distance between the semi-group and its perturbation. Obviously, as a perturbation of a semi¬ 
group on the space of probability measures, we are here thinking of a system of N interacting 
particles, exactly as that formed by the Nash equilibrium of an N—player game. 

Identifying the master equation with a nonlinear PDE driven by the Markov generator of a 
lifted semi-group is a key observation. As already pointed out, the Markov generator is precisely 
the operator, acting on functions from V{T'^) to M, generated by the forward component of the 
MFG system (llOh . Put it differently, the law of the forward component of the MFG system 
m, which lives in P(P(T'^)), satisfies a forward Kolmogorov equation, also referred to as a 
“master equation” in physics. This says that “our master equation” is somehow the dual (in the 
sense that it is driven by the adjoint operator) of the “master equation” that would describe, 
according to the terminology used in physics, the law of the Nash equilibrium for a game with 
infinitely many players (in which case the Nash equilibrium itself is a distribution). We stress 
that this interpretation is very close to the point of view developed by Mischler and Mouhot |56j 
in order to investigate Kac’s program (up to the difference that, differently from ours, Mischler 
and Mouhot’s work investigates uniform propagation of chaos over an infinite time horizon; we 
refer to the companion paper by Mischler, Mouhot and Wennberg [57] for the analysis, based 
on the same technology, of mean-field models in finite time). Therein, the authors introduce 
the evolution equation satisfied by the {lifted) semi-group, acting on functions from P(M'^) to 
M, generated by the d-dimensional Boltzmann equation. According to our terminology, such an 
evolution equation is a “master equation” on the space of probability measures, but it is linear 
and of the first-order while ours is nonlinear and of the second-order (meaning second-order on 
P(T^)). 

In this perspective, we also emphasize that our strategy for proving the convergence of 
the AI—Nash system relies on a similar idea to that used in [56] to establish the convergence of 
Kac’s jump process. While our approach consists in inserting the solution of the master equation 
into the N—Nash system, Mischler and Mouhot’s point of view is to compare the semi-group 
generated by the AI—particle Kac’s jump process, which operates on symmetric functions from 
(]^(i)Af equivalently on empirical distributions of size N), with the limiting lifted semi¬ 

group, when acting on the same class of symmetric functions from (M'’*)'^ to M. Glearly, the 
philosophy is the same, except that, in our paper, the “limiting master equation” is nonlinear 
and of the second-order (which renders the analysis more difficult) and is set over a finite time 
horizon only (which does not ask for uniform in time estimates). It is worth mentioning that 
similar ideas have been explored by Kolokoltsov in the monograph [39] and developed, in the 
McKean-Vlasov framework, in the subsequent works [iQ] and [H] in collaboration with his 
coauthors. 

Of course, these parallels raise interesting questions, but we refrain from comparing these 
different works in a more detailed way: This would require to address more technical questions 
regarding, for instance, the topology used on the space of probability measures and the regularity 
of the various objects in hand; clearly, this would distract us from our original objective. We 
thus feel better to keep the discussion at an informal level and to postpone a more careful 
comparison to future works on the subject. 

We complete the introduction by pointing out possible generalizations of our results. For 
simplicity of notation, we work in the autonomous case, but the results remain unchanged if H or 
F are time-dependent provided that the coefficients F, G and H, and their derivatives (whenever 
they exist), are continuous in time and that the various quantitative assumptions we put on F, 
G and H hold uniformly with respect to the time variable. We can also remove the monotonicity 
condition d?]) provided that the time horizon T is assumed to be small enough. The reason is 
that the analysis of the smoothness of U relies on the solvability and stability properties of the 
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forward-backward system (jlOjl and of its linearized version: As for finite-dimensional two-point 
boundary value problems, Lipschitz type conditions on the coefficients (and on their derivatives 
since we are also dealing with the linearized version) are sufficient whenever T is small enough. 

As already mentioned, we also chose to work in the periodic framework. We expect for similar 
results under other type boundary conditions, like the entire space or Neumann boundary 
conditions. 

Notice also that our results can be generalized without much difficulty to the stationary 
setting, corresponding to infinite horizon problems. This framework is particularly meaningful 
for economic applications. In this setting the Nash system takes the form 

" N N 

rv^’\x) - 2 A^.v^’^x) - /3 Yi + H{xi,D^^v^’\x)) 

+ Y DpH{xj,D^.v^’^{x)) • D,,^v^\x) = F^\x) in 

where r > 0 is interpreted as a discount factor. The corresponding master equation is 

^ rU -{l + ^)A^U + H{x,D,,U) 

-(1 + 13) [ divy[DmU] dm(y) + f D^U ■ DpH(y,D^U) dm(y) 

< jRd JR‘i 

-2(3 I diVa; [DmU] dm(y) - (3 [ Tr dm 0 dm = F{x, m) 

Jm3 J-^2d 

in R'^ X 7^(R'^), 

where the unknown is the map U = U(x,m). One can solve again this system by using the 
method of (infinite dimensional) characteristics, paying attention to the fact that these charac¬ 
teristics remain time-dependent. The MFG system with common noise takes the form (in which 
the unknown are now (ut,mt,Vt))'. 

dtut = {rut - (1 + P)Aut + H(x,Dut) - F(x,mt) - ^/^dYv{vt)]dt + vt • ^/WdWt 

in [0, -l-oo) X 

^ dtmt = [(1 -t (3)Amt + div(mtDpH{mt, Dut))]dt - div(mfs/^dWt), 

in [0, -too) X 

mo = mo in T'^, (ut)t bounded a.s. 

Organization of the paper. We present our main results in Section [2l where we also explain 
the notation, state the assumption and rigorously define the notion of derivative on the space of 
measures. The well-posedness of the master equation is proved in Section [3] when (3 = d. Unique 
solvability of the MFG system with common noise is discussed in Section 01 Results obtained 
in Section 0] are implemented in the next Section [5] to derive the existence of a classical solution 
to the master equation in the general case. The last section is devoted to the convergence of 
the Nash system. In appendix, we revisit the notion of derivative on the space of probability 
measures and discuss some useful auxiliary properties. 

1.2 Informal derivation of the master equation 

Before stating our main results, it is worthwhile explaining the meaning of the Nash system, the 
heuristic derivation of the master equation from the Nash system and its main properties. We 
hope that this (by no means rigorous) presentation might help the reader to be acquainted with 
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our notation and the main ideas of proof. To emphasize the informal aspect of the discussion, 
we state all the ideas in without bothering about the boundary issues (whereas in the rest 
of the paper we always work with periodic boundary conditions). 


1.2.1 The differential game 

The Nash system m arises in differential game theory. Differential games are just optimal 
control problems with many (here N) players. In this game, Player i (for i = 1,..., N) controls 
his state iXi^t)te[o,T] through his control iai^t)te[o,T]- The state iXi^t)te[o,T] evolves according to 
the stochastic differential equation (SDE) 

dXi^t = ai,tdt + \f2dBl + s/^dWt, Xt^ = Xi^. (12) 


Recall that the d-dimensional Brownian motions ((d?i)tg[o,T])i=i,..., 7 V and iWt)te[o,T] are indepen¬ 
dent, (Bj)ig[o,T] corresponding to the individual noise (or idiosyncratic noise) to player i and 
(Wt)te[o,T] being the common noise, which affects all the players. Controls 
are required to be progressively-measurable with respect to the filtration generated by all the 
noises. Given an initial condition xq = (xi^Oi • • • :Xn,o) £ for the whole system at time to, 

each player aims at minimizing the cost functional: 


r f-T 


(to,®o,(aj,)j=i,...,Ar) =E 


-Jto 


(L(Xi,„ ai^,) + F^^\Xs)) ds + G^'^{Xt) 


where Xt = (Xi,*,..., Xn,t) and where L : x ^ M, ^ M and ^ M 

are given Borel maps. If we assume that, for each player i, the other players are undistinguish- 
able, we can suppose that and take the form 




N,i\ 

X 


and G^’*(£c) = G{xi,mX'‘). 


In the above expressions, F, G : x V{W^) —> M, where F(]R'^) is the set of Borel measures on 

M'^. The Hamiltonian of the problem is related to L by the formula: 


\l{x,p) 6 X H{x,p) = sup {—a ■ p — L{x, a)} . 

asR'^ 


Let now (t'^’*)i=l,...,Ar be the solution to ([T|). By Ito’s formula, it is easy to check that (w^’*)i=l,...,Af 
corresponds to an optimal solution of the problem in the sense of Nash, i.e., a Nash equilibrium 
of the game. Namely, the feedback strategies 

{a*{t,x) := -DpH{xi,D^^v^’\t,x))).^^^ (13) 

provide a feedback Nash equilibrium for the game: 

V ’ (tO)®o) “ Ji (^0) ®0) (®j,.)j = l,...,A:') ^ Ji (^0; ®0) (bj,.)j^i) 

for any i 6 {1,... , X} and any control a,^., progressively-measurable with respect to the filtration 
generated by ((.Bj )j=i,...,Ar)ts[o,T] and {Wt)te[o,T]- In the left-hand side, a*, is an abusive notation 
for the process {,oi*{t,Xj^t))te[o,T]^ where (Xi^t, ■ ■ ■, X]\f^t)te[o,T] solves the system of SDEs (fT^ 
when aj^t is precisely given under the implicit form = ®|(LXj^t). Similarly, in the right- 
hand side, a*, for j =|= i, denotes {Ojit, Xj^t))te[o,T]i where {Xi^t, ■ ■ ■ i X]\f^t)te[o,T] now solves the 
system of SDEs (I12p for the given Oi^., the other {aj^t)ji\s being given under the implicit form 
Uj^t = Cijit^Xj^t)- In particular, system ([2]), in which all the players play the optimal feedback 
m, describes the dynamics of the optimal trajectories. 
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1.2.2 Derivatives in the space of measures 

In order to describe the limit of the maps let us introduce—in a completely informal 

manner—a notion of derivative in the space of measures A rigorous description of the 

notion of derivative used in this paper is given in section [221 

In the following discussion, we argue as if all the measures had a density. Let U : P(]R‘^) ^ M. 
Restricting the function U to the elements m of which have a density in and 

assuming that U is defined in a neighborhood O a of n we can use the 

Hilbert structure on We denote by the gradient of U in namely 

^{p){q) = l^J^^^(u{p + eq)-U{p)y peO, 

Of course, way can identify |^(p) with an element of Then, the duality product 

j^{p){q) reads as the inner product (|^(p),^)L 2 (Rd)- Similarly, we denote by the second 
order derivative of U (which can be identified with a symmetric bilinear form on L^(]R‘^)): 

+ P^O, q,q e 

We set, when possible, 

DmU{m,y) = Dy^{m,y), Dl^^U{m,-,y,y') = Dly,^{m,y,y'). (14) 

To explain the meaning of DmU, let us compute the action of a vector field on a measure m and 
the image by U. For a given vector field B : and m 6 P(M'^) absolutely continuous 

with a smooth density, let m{t) = m{x,t) be the solution to 

[ ^ + div{Bm) = 0 
( mo = m 

This expression directly gives 

J-[/(m(/i))|^^g = <^,-div(Hm)>i 2 (Kd) = DmU{m,y) ■ B{y) dm{y), (15) 

where we used an integration by parts in the last equality. 

Another way to understand these derivatives is to project the map U to the finite dimensional 
space (M'^)^ via the empirical measure: if ® = (xi,... ,xn) e , let := {l/N) Xiili ^xi 

and set u^{x) = U{m^). Then one can check the following relationships (see Proposition 16.11) : 
for any j e {1,... ,iV}, 

D^.u^{x) = ^DmU{m^,Xj), (16) 

= ^Dy [DmU] {m^,Xj) + ^D^^C/(m^,x^) (17) 

while, if j ¥= k, 

= ■^Dl^mU{m^,Xj,Xk). (18) 
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1.2.3 Formal asymptotic of the 

Provided that m has a unique solution, each for i = 1,... ,N, is symmetric with respect 
to permutations on {1,and, for i =|= j, the role played by x* in is the same as the 
role played by in (see Subsection I6.2ji . Therefore, it makes sense to expect, as limit as 
N —> +00, 

where U : [0,T] x x ^ M. Starting from this ansatz, our aim is now to provide 

heuristic arguments explaining why U should satisfy Q. The sense in which the 
actually converge to U is stated in Theorem 12.131 and the proof given in Section [6l 

The informal idea is to assume that is already of the form U{t, Xi,mx’^) and to plug this 
expression into the equation of the Nash equilibrium ([T]): the time derivative and the derivative 
with respect to Xi are understood in the usual sense, while the derivatives with respect to the 
other variables are computed by using the relations in the previous section. 

The terms and H{xi,Dx^v^’'^) easily become ^ and H{x, DxU). We omit for a while 

the second order terms and concentrate on the expression 

j^i 


Note that is just like DxU{t,Xj,mx’^)- In view of (fT6l) . 




J 1 

1 

N - 1 


DmU(t, Xi, nig. ’ , Xj ), 


and the sum over j is like an integration with respect to rrix’^- So we find, ignoring the difference 
between rrix’^ and rrix’^, 


Y,DpHixj,Dx^v^’^)-Dx^v^'^ 

ji^i 


/ 


DpH{y,DxU{t,m2^\y)) ■ DmU{t,Xi,m^^^\y)dTn^ 


N,i 


(y)- 


We now study the term ^ Ax^ As ~ AxU, we have to analyze the quantity 

j 

Ax^v^'^. In view of (fTTll , we expect 


2 ^ [^rnU] {t, Xi, m^'\ Xj) + 

j^i j^i 

- [ divy [DmU] {t, Xi,m^'\ y)dm^'\y) + ^ 

J-fd iV - i 


I > j^i 

[ Tr {t,Xi,m^’\y,y)dm^'\y), 

JTd 


where we can drop the last term since it is of order 1/N. 


Let us finally discuss the limit of the term 


k,l 


d2vN,i 

dxkdxi 


that we rewrite 


Ax,v 


N,i 


2VTr( 


k^i 


d dv^’\ 

dxi dxk ' 


k,l^i 


Q 2 vN,i 

dxkdxi 


(19) 
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The first term gives AxU. Using (|16p the second one becomes 


2ZlTr(—^—) ^ 


k^i 


'dxk dxi 


N - 


^2 / d\Wx[DrnU]{t,Xi,m^^'\y)drn‘^'\y). 
Jt>- 


Y 2 Tr [DxDmU] {t, Xi,m^’\ x^) 


k^i 


As for the last term in (jl9l) . we have by (IlSp : 

k^l^i kjl^i 

- [ [ ^[DlimU]{t,Xi,mx’\y,y')dmx’\y)dmx'\y). 

Jjd Jjd 

Collecting the above relations, we expect that the Nash system 


dv^’"' 

dt 




kl 


dxkdxi 


) + Hixi,Dxy'^) 


Y DpH{xj,Dx^v^’^) • Dx^’^ = F{xi,mY) 


v^'\T,x) = G{xi,m^’^) 


has for limit 


r dU ^ 

dt " 


U— divy[DmU]dm + H{x,m, DxU) 

jRd 

—/3 ( Aa;C/+ 2 / diVx[DmU]dm + / div^ [DmU] dm + / Tr dm ® dm 

\ jRd jRd jR2d 

+ [ DmU ■ DpH{y,DxU)dm{y) = F{x,m) 

jRd. 

U{T,x,m) = G{x,m). 


This is the master equation. Note that there are only two genuine approximations in the above 
computation. One is where we dropped the term of order 1/N in the computation of the sum 
Yjji^iAxjV^’^- The other one was at the very beginning, when we replaced DxU{t,Xj,mx'^) by 
DxU{t,Xj,mx’^)- This is again of order 1/N. 


1.2.4 The master equation and the MFG systems 

We complete this informal discussion by explaining the relationship between the master equation 
and the MFG systems. This relation plays a central role in the paper. It is indeed the corner¬ 
stone for constructing a solution to the master equation via a method of (infinite dimensional) 
characteristics. However, for pedagogical reasons, we here go the other way round: While, in 
the next sections, we start from the unique solvability of the system of characteristics to prove 
the existence of a classical solution to the master equation, we now assume for a while that the 
master equation has a classical solution and, from this solution, we construct a solution to the 
MFG system. 

Let us start with the first order case, i.e., when /3 = 0, since this is substantially easier. 
Let U be the solution to the master equation ([9]) and, for a fixed initial position (to,m'(o)) ^ 
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[0,T] X {u,m) be a solution of the MFG system ([6]) with initial condition m{to) = m(o)- 

We claim that 


( 20 ) 


dtm — Am — div(jnDpH (x, D^U{t, x, = 0, 

u{t,x) = U{t,x,m{t)), te[tQ,T]. 

In other words, to compute U{to,x,m(^o)), we just need to compute the solution {u,m) of the 
MFG system ([6]) and let U{to,x,m(^Q)) := u{to,x). This is exactly the method of proof of 
Theorem 12.81 

To check (I20p . we solve the McKean-Vlasov equation 

dtm' — Am' — di\ [m'DpH [x,DxU{t^x^m'{t)))^ =0, m'{tQ,-) = m(o), 

and set u'{t,x) = U{t,x,m'{t)). Then 

dtu'{t,x) = dtU + + (^^, Am' + div{m'DpH{-,D^U))'^^^ 


= dtU+ [ (divj, [DmU] - DmU • DpH{y, D^U))dm'{y) 
jRd- ^ ' 


( 21 ) 


= -AHJ+ H{x,D^U)-F{x,m) 

where we used the equation satisfied by U in the last equality. Therefore the pair {u',m') is a 
solution to ([6]), which, provided that the MFG system is at most uniquely solvable, shows that 
{u', m') = (u, m). 


For the second order master equation (/3 > 0) the same principle applies except that, now, 
the MFG system becomes stochastic. Let (to)"^'(o)) ^ [0)^1 ^ 'P(M'^) and {ut,mt) be a solution 
of the MFG system with common noise (llOp . Provided that the master equation has a classical 
solution, we claim that 

dtmt = ^{1 + l3)Amt + div(mtDpH(^x,D^U{t,x,mt))'^^dt +-y/^div^midWt), ^^ 2 ) 
ut{x) = U{t,x,mt), t e [to,T], a-s.. 

Once again, we stress that this formula (whose derivation here is informal) underpins the rigorous 
construction of the second order master equation performed in Section [5j As a matter of fact, 
it says that, in order to define U{tQ,x,m(^^) (meaning that U is no more a priori given as we 
assumed a few lines above), one “just needs” to solve the MFG system (fTU|l with mt^ = m(o) 
and then set U(tQ,x,m^Q)) = Here one faces the additional issue that, so far, there has 

not been any solvability result for ([8]) and that the regularity of the map U that is defined in 
this way is much more involved to investigate than in the first order case. 

Returning to the proof of (j22|) (and thus assuming again that the master equation has a 
classical solution), the argument is the same in the case /? = 0, but with extra terms coming 
from the stochastic contributions. First, we (uniquely) solve the stochastic McKean-Vlasov 
equation 

dtm'^ = |(1 -I- /?) Am( -I- div^m'^DpH (x, DxU{t, x, m())^ ^ dt + y/^div{m'^dWt), m'^^ = mo, 

and set u((x) = U{t,x,m't). Then, by Ito’s formula, 

dtu'tix) = l^tC/ + (^^,{1 +l3)Am't + diY{m'tDpH{-,D^U))y^^+p(^^^Dm't,Dm'ty^^^dt 

( 23 ) 
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In comparison with the first-order formula (j2ip . equation ()23l) involves two additional terms; 
The stochastic term on the second line derives directly from the Brownian part in the forward 
part of (jlOp whilst the second order term on the first line is reminiscent of the second order term 
that appears in the standard Ito calculus. We provide a rigorous proof of ([231) in Section [5l 
Using (fT4]l . we obtain 

dtu[{x) = [dtU + + ^)<XiYy[D^U]- D^U ■ DpH{-,D^U)ymt 

+13 [ TT[D‘^^U]dm[0m[\dt 
+ ( [ DmUdm[) • ^/^dWt 
Taking into account the equation satisfied by U, we get 

diu'(x) = + + -2/3 / dw,[DmU]dm[-F]dt 

jRd ) 

+ ( [ DmUdm[) ■ ^^dWt 

= {-(1 + /3)A^[/ + D^U) - 2/3div(u0 - F]dt + v[ ■ ^J^dWt 


Jw 


for v[:= / DraUdm[. 


This proves that {u[,m[,v^) is a solution to the MFG system (fTOjl and, provided that the 
MFG system is at most uniquely solvable, proves the claim. 
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2 Main results 


In this section we collect our main results. We first state the notation used in the paper, specify 
the notion of derivatives in the space of measures, and describe the assumptions on the data. 


2.1 Notations 


Throughout the paper, denotes the d—dimensional euclidean space, with norm | • |, the scalar 
product between two vector a,b e being written a • b. We work in the d—dimensional torus 
(i.e., periodic boundary conditions) that we denote := When is a (large) integer, 

we use bold symbols for elements of (T'^)'^: for instance, x = (xi, ... ,xn) £ (T'^)^. 

The set V{T'^) of Borel probability measures on is endowed with the Monge-Kantorovich 
distance 

di(m,m') = sup / (j){y) d{m — m'){y), 

(/> Jrd 

where the supremum is taken over all Lipschitz continuous maps (() : ^ R with a Lipschitz 

constant bounded by 1. Let us recall that this distance metricizes the weak convergence of 
measures. If m belongs to V{T'^) and 0 : ^ is a Borel map, then denotes the push- 

forward of m by 0, i.e., the Borel probability measure such that [())(jm](^) = m{(l)~^{A)) for 
any Borel set yl When the probability measure m is absolutely continuous with respect 

to the Lebesgue measure, we use the same letter m to denote its density. Namely, we write 
m 3 X ^ m{x) e R+. Besides we often consider flows of time dependent measures of the 
form (m(t))tg[o,T]) with m{t) e V{T^) for any t e [0,T]. When, at each time t e [0,T], m{t) 
is absolutely continuous with respect to the Lebesgue measure on T*^, we identify m{t) with its 
density and we sometimes denote by m : [0, T] x 9 (t, x) m{t, x) e R+ the collection of the 
densities. In all the examples considered below, such an m has a time-space continuous version 
and, implicitly, we identify m with it. 

If (() : —> R is sufficiently smooth and £ = (£i,... ,£d) e N*^, then stands for the 

derivative -^-j- ... The order of derivation is denoted by \£\. Given e e R“, 

we also denote by decj) the directional derivative of cj) in the direction e. For n e N and a e (0,1), 
^n+a -g gg^ q£ fQj. which D^(f) is defined and a—Holder continuous for any £ e with 
\£\ ^ n. We set 


<(*||n+Q: 


2 snp\D^<p{x)\+ 2 sup 


D^(j){x) - D^(l){x')\ 



The dual space of is denoted by (C"'+“)' with norm 
Vpe(C”+")', ||p||_(„+^) := sup 

||0||n + Q;^l 


(P: 


n + a . 


If a smooth map xjj depends on two space variables, e.g. xjj = i^{x, y), and m, n e N are the order 
of derivation of V’ with respect to x and y respectively, we set 

liV’IlKn) := 2 


and, if moreover the derivatives are Holder continuous 
II V’||(m+Q:,n+a) * 


, W „„„ \D^^’^'"> 4 >{x,y) - ,y')\ 

(m,n) / j sup I /la I I /la 

|x-xr + |y-yr 


|^|=m,|£'|=n ' 
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The notation is generalized in an obvious way to mappings depending on 3 or more variables. 

If now the (sufficiently smooth) map (p depends on time and space, i.e., p = (p{t,x), we say 
that (p e (where I = n + a, n e N, a e (0,1)) if D^Dlp exists for any 6 N'^ and j € N with 
\l\ + 2j ^ n and is a—Holder in x and a/2—Holder in t. We set 

\\4>\\n/2+a/2 ,n+a ■ s 

|■^|+ 2 _ 7 ^n \i\+2j^n 


with 


(D^Dlpy^^a ■= sup 

t,x^x' 


\4>(t,x) - 4>(t,x')\ 


X — X 


{D^Dl4>)t,a := sup 


\p(t,x) - (t)(t',x)\ 


,X 


t-t' 


If X,Y are a random variables on a probability space (11,^, P), C(X) is the law of X and 
C(Y\X) is the conditional law of Y given X. Recall that, whenever X and Y take values in Polish 
spaces (say Sx and Sy respectively), we can always find a regular version of the conditional law 
C{Y\X), that is a mapping q : Sx x B(Sy) [0,1] such that: 

• for each x e Sx, g(x,-) is a probability measure on Sy equipped with its Borel u-field 
B(Sy), 

• for any A e B(Sy), the mapping Sx b x <—>■ q(x. A) is Borel measurable, 

• q{X, •) is a version of the conditional law of X given Y, in the sense that 


^[f{X,Y)]= [ (f f{x,y)q{x,dy)'\d{£{X)){x) =E f f{X,y)q{X,dy) 
J Sx V*' / L*' Sy 


for any bounded Borel measurable mapping / : Sx x Sy —> M. 


2.2 Derivatives 

One of the striking features of the master equation is that it involves derivatives of the unknown 
with respect to the measure. In the paper, we use two notions of derivatives. The first one, 
denoted by is, roughly speaking, the L? derivative when one looks at the restriction of ViT^) 
to densities in L^(T'^). It is widely used in linearization procedures. The second one, denoted 
by DmU, is more intrinsic and is related with the so-called Wasserstein metric on P(T'^). It can 
be introduced as in Ambrosio, Gigli and Savare [3j by defining a kind of manifold structure on 
P(T'^) or, as in Lions [52], by embedding P(T'^) into an L^(II,T'^) space of random variables. 
We introduce this notion here in a slightly different way, as the derivative in space of |^. In 
appendix we briefly compare the different notions. 


2.2.1 First order derivatives 

Definition 2.1. We say that U : P(T'^) ^ 
P(T'^) X ^ M such that, for any m,m' e Viff^), 

U ((1 — s)m + sm') — U (m) 

hm - = 

s^ 0 + s 


is if there exists a continuous map 


6m 


r 6u 

Ijd 6m 


(m, y)d(m' — m)(y). 


Note that is defined up to an additive constant. We adopt the normalization convention 



( 24 ) 
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For any m e P(T^) and any signed measure /x on T^, we will use indifferently the notations 

6u . r SU . 


— (m)(^) and / —{m,y)dn{y). 


6m 


Ijd 

Note also that 


6m 


Vm, m' e V{T^), U{m') - U{m) = 



0 


6m 


((1 — s)m + sm', y) d{m' — m){y)ds. (25) 


Let us explain the relationship between the derivative in the above sense and the Lipschitz 

SU SU 

continuity of U in If -r— = -— (m,y) is Lipschitz continuous with respect to the second 

6m 6m 

variable with a Lipschitz constant bounded independently of m, then U is Lipschitz continuous: 
indeed, by 


\U{m')-U{m)\ ^ 

^ sup 


f 


SU 

Dy-—((l — s)m + sm',-) ds di(m,m') 


This leads us to define the “intrinsic derivative” of U. 

SU 

Definition 2.2. If -— is of class with respect to the second variable, the intrinsic derivative 
6m 

DmU : 'P{T^) X is defined by 

DmU{m,y) := Dy—{m,y) 

The expression D^U can be understood as a derivative of U along vector fields: 

Proposition 2.3. Assume that U is , with with respect to y and DmU is continuous 

in both variables. Let (/> : T'^ —> be a Borel measurable and bounded vector field. Then 


lim 

h^O 


U{{id + h(j))‘^m) — U{m) 
h 


/ DmU{m,y) ■ (j){y) dm{y). 

JTd- 

Proof. Let us set mh,s ■= s{id + h(j))'^m + (1 — s)m. Then 

n 6U 

7 — {nT'h s, y)d{{id + h(j))‘^m — m){y)ds 
•d 6m 

f SU^ su^ 

= / / {^{^h,s,y + h(t){y)) -—{mh,s,y))dm{y)ds 

Jo JTd 6m^ dm 

= h / DmU{mh,s,y+ th(f){y)) ■ (j){y) dtdm{y)ds. 

Jo JT'i Jo 

Dividing by h and letting h ^ 0 gives the result thanks to the continuity of D^U. 


□ 


Note also that, if U : P(T'^) —> M and is in y, then DyDjnU{m,y) is a symmetric 
matrix since 

( SU\ SU 

= HesSy — (m,y). 
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2.2.2 Second order derivatives. 


If, for a fixed y e T'^, the map m >—>■ - — (m, y) is C^, then we say that U is and denote by -—^ 

dm dm^ 

its derivative. (Pay attention that y is fixed. At this stage, nothing is said about the smoothness 

d^U 


in the direction y.) By Definition 12. II we have that 


dm‘^ 


: V{r) X X T'" 


with 


dm 


{m',y) - ^{m,y) = [ [ - s)m + sm',y,y') d{m'- m){y'). 

dm Jq J^d dm^ 


d‘^U d‘^U 

If U is and if ^—rr = t— -{m,y,y') is in the variables {y,y'), then we set 


(5m^ (5m^ 


d^U 


DmmU{m,y,y) := Dy^y,j^{m,y,y'). 


We note that : V{T‘^) x x ^ The next statement asserts that -—^ enjoys 

the classical symmetries of second order derivatives. 


d^U 


Lemma 2.4. Assume that 


d^U 

dm‘^ 


is jointly continuous in all the variables. Then 


^-^{m,y,y') = ^-^{m,y',y), m6p(T'^), y,y'6T'^. 

om^ dm^ 

fy yy 

In the same way, if -— is in the variable y and -—^ is also in the variable y, Dy -—^ 
dm dm^ dm^ 

being jointly continuous in all the variables, then, for any fixed y e the map m ^ DmU{m,y) 
is and ^ 

^ ^;{m,y,y') = ^{DmU{m,y)){y'), me'P{T'^), y,y'eT‘^, 

dm 


A 


dm^ 


while, if-—K is alsoC^ in the variables {y,y'), then, for any fixedy e T'^, the map - — (DmU{-,y)) 
dm^ dm 

is in the variable y' and 


Dm{DmU{-,y)){m,y') = Dl,^U{m,y,y'). 


Proof. First step. We start with the proof of the first claim. By continuity, we just need to show 
the result when m has a smooth positive density. Let pi,v e L®(T^), such that h = fjd n = 0, 
with a small enough norm so that m + is a probability measure for any (s, t) e [0,1]^. 

Since C/ is A, the mapping U : [0, 1]^ e (s,t) ^ F(m + sfi + nn) is twice differentiable and, 
by standard Schwarz’ Theorem, DtDsU{s,t) = DsDtU{s,t), for any {s,t) e [0,1]^. Notice that 


DtDg 

DsDv 


U{s,t) = + stx + tv,y,y')n{y)v{y')dydy' 

U{s,t) = f 
in 


d^U 

[Td]2 (5m^ 


(m + s/x + tv, y', y)p.{y)v{y')dydy'. 


Choosing s = t = 0, the first claim easily follows. 

Seeond step. The proof is the same for the second assertion, except that now we have to 

su 

consider the mapping U' : [0,1] x 3 (t, y) i—> -—(m + tp, y), for a general probability measure 

dm 
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m e V{T'^) and a general finite signed measure jj, on such that = 0 and m + ;U is a 

probability measure. (In particular, m + tfj, = (1 — t)m + t{m + is also a probability measure 
for any t 6 [0,1].) By assumption, Li' is in each variable t and y with 

DtU'{t, y) = y)dKy')^ DyU'{t, y) = DmU{m + ty, y). 

In particular, DtW is in y and 

6^U 


DyDtU'{t,y) = / ^Dyj^{m + ti^,y,y')y{y')dy' 


I'Id 

By assumption, DyDtU' is jointly continuous and, by standard Schwarz’ Theorem, the mapping 
DyW is differentiable in t, with 

Dt{DyU'){t,y) = Dt{DmU {m + ty,y)) = J ^Dy^^{m + tfj,,y,y')y{y')dy'. 
Integrating in t, this shows that 

n s'^u 

y^ y')y{y')dy'dt. 

Choosing y = m' — m, for another probability measure m' e P(T'^) and noticing that (see 
Remark 12.51 below): 

6‘^U 


f D 

JTd 


^ dm? 


[m,y,y')dm{y') = 0, 


we complete the proof of the second claim. 

For the last assertion, one just need to take the derivative in y in the second one. □ 


Remark 2.5. Owing to the convention (1241) . we have 

r 6^u 


VyeT'', , 

Jjd 

when U is C"^. By symmetry, we also have 


{m,y,y')dm{y') = 0, 


Vy'eT"*, f ^'^{m,y,y')dm{y) = 0. 

Jjd dm^ 


And, of course. 


f[Td]: 


dm^ 


m,y,y')dm{y)dm{y') = 0. 


2.2.3 Comments on the notions of derivatives 

Since several concepts of derivatives have been used in the mean field game theory, we now 
discuss the link between these notions. For simplicity, we argue as if our state space was M'’* and 
not T^, since most results have been stated in this context. (We refer to the Appendix for an 
exposition on T^.) 

A first idea consists in looking at the restriction of the map U to the subset of measures with 
a density which is in L^(]R‘’*), and take the derivative of U in the L^(M'’*) sense. This is partially 
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the point of view adopted by Lions in [52] and followed by Bensoussan, Frehse and Yam 


su 

In the context of smooth densities, this is closely related to our first and second derivatives — 

dm 

and -— 7T. 

Many works on mean field games (as in Buckdahn, Li, Peng and Rainer |16] . Carmona and 
Delarue m, Chassagneux, Crisan and Delarue m, Gangbo and Swiech [28]) make use of an 
idea introduced by Lions in [52]. It consists in working in a sufficiently large probability space 
(D,^, P) and in looking at maps U : ^ M through their lifting to L^(D, P; M'^) defined 

by 

UiX) = U{C{X)) ^X e 

where C{X) is the law of X. It is clear that the derivative of U —if it exists—enjoys special 
properties because U{X) depends only on the law of X and not on the full random variable. As 
explained in [52], if U is differentiable at some point Xq e L^(D, P; R'^), then its gradient can 
be written as 

VU{Xo) = d^U{C{Xo)){Xo), 

where 5^17 : R(R'^) x R'^ 3 (m,x) d^U{m){x) e R'^. We explain in the Appendix that the 
maps d^U and DmU introduced in Definition 12.21 coincide, as soon as one of the two derivatives 
exists. Let us also underline that this concept of derivative is closely related with the notion 
introduced by Ambrosio, Gigli and Savare [3] in a more general setting. 


2.3 Assumptions 

Throughout the paper, we assume that H : x R'^ 


and satisfies the coercivity condition: 

1—1 , p ,2 


c- 


1 + \p\ 


R is smooth, globally Lipschitz continuous 
^ D'^ H{x,p) ^Cld for (x,p) 6 T'^ X R'^. (26) 


We also always assume that the maps F,G x V{T'^) 
and monotone: for any m,m' e R(T'^), 


are globally Lipschitz continuous 


/ {F{x,m) — F{x,m'))d{m — m'){x) ^ 0, / {G{x,m) — G{x,m'))d{m — m'){x) ^ 0. (27) 

Note that assumption ([Tf|) implies that and satisfy the following monotonicity property 
(explained for F): 

6F 


/ / 

jjd jjd 


-{x,m,y)p{x)p{y)dxdy ^ 0 


I'fd Sm 

for any centered measure p. Throughout the paper the conditions (I26|) and (1271) are in force. 


Next we describe assumptions that might differ according to the results. Let us fix n 6 N 
and a e (0,1). We set (with the notation introduced in subsection 12.ip 


6F 

LiPn(^) := sup (di(mi,m 2 )) 

0771 1711^1712 


-1 


6F 6F 

— (•,mi,-) - —(•,m2,-) 

dm dm 


(n+Q:,n+a) 

and use the symmetric notation for G. We call (HFl(n)) the following regularity conditions 
on F: 


(HFl(n)) 


sup 

mGp(T^) 


||F(-, m)\ 


n+CK 


6F{-,m,-) 


6m 




+ < CO, 
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and (HGl(n)) the symmetric condition on G: 


(HGl(n)) 


sup 


||G(-,m) 




In+Q 


6m 


(n+ci,n+Q:) j 

We use similar notation when dealing with second order derivatives: 


T- 

+ np„(^) < CO. 


LiPnClZ:^) := sup (di(mi,m2)) mi, m2, •, ') 






6^F 
5m^ * 


(ri.+Q:,n+Q,n+Q;) 


and call (HF2(n)) (respectively (HG2(n))) the second order regularity conditions on F: 


(HF2(n)) sup ||cP(-,m)||„+„ 

mGp(T'^) \ 


and on G\ 


6F{;m, •) 


6m 


sup 

me'P(T‘^) 


6^Fi;m,-r) 


(n+Q:,n+Q;) 


6m^ 


^6^F, 


(n+Q:,n+a,n+Q) 




(HG2(n)) sup ||G(-,m)||„+^ 

ms'P(T'i) \ 


6G{-,m, •) 


• sup 

mG'P{T^) 


6‘^G{-,m,-,-) 


6m? 


(n+a,n+a)y 

+ LiPn(^) < 00- 

(n+a,n+Q:,n+a) 


Example 2.6. Assume that F is of the form: 

F{x,m) = / ^{z, {p * m){z))p{x — z)dz, 

jR<i 


where * denotes the usual convolution product (in W^) and where $ : —> M is a smooth map 
which is nondecreasing with respect to the second variable and p is a smooth, even function with 
compact support. Then F satisfies the monotonicity condition (1271) as well as the regularity 
conditions (HFl(n)) and (HF2(n)) for any n e N. 


Proof. Let us first note that, for any m,m' e 


/Td 


(F(x, m) — F(x, m'))d{m — m'){x) 


/Td 


[$(y, p * m{y)) - ^{y, p * m'{y))\ (p * m{y) - p * m'{y)) dy ^ 0, 


since p is even and is nondecreasing with respect to the second variable. So F is monotone. 
Writing $ = <l>(x, 0), the derivatives of F are given by 


6m 


{x,m,y) = f ^{z,p*m{z))p{x - z)p{z - y)dz 
jKd de 


and 


6^F 

6m? 


{x,m,y,y') = -^{z, p m{z))p{z - y)p{z - y')p{x - z)dz. 


Then (HFl(n)) and (HF2(n)) hold because of the smoothness of p. 


□ 
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2.4 Statement of the main results 

The paper contains two main results: on the one hand the well-posedness of the master equation, 
and, on the other hand, the convergence of the Nash system with N players as N tends to 
infinity. We start by considering the first order master equation (/3 = 0), because, in this 
setting, the approach is relatively simple fTheorem 12.81) . In order to handle the second order 
master equation, we build solutions to the mean field game system with common noise, which 
play the role of “characteristics” for the master equation (Theorem 12.9|) . Our first main result 
is Theorem 12.111 which states that the master equation has a unique classical solution under 
our regularity and monotonicity assumptions on H, F and G. Once we know that the master 
equation has a solution, we can use this solution to build approximate solutions for the Nash 
system with players. This yields to our main convergence results, either in term of functional 
terms fTheorem 12.13p or in term of optimal trajectories fTheorem 12.151) . 

2.4.1 First order master equation 

We first consider the first order master equation (or master equation without common noise): 

—dtU{t,x,m) — AxU{t,x,m) + H{x,DxU{t,x,m)) — / divy [DmU] {t,x,m,y) dm{y) 

JTi 

+ / DmU{t,y,m,y) ■ DpH{y,D^U{t,y,m)) dm{y) = F{x,m), 

" JTd. 

in [0,r] X X T’(T“'), 

U{T,x,m) = G{x, m) in x . 

(28) 

We call it the first order master equation since it only contains first order derivatives with respect 
to the measure variable. Let us first explain the notion of solution. 

Definition 2.7. We say that a map U : [0,T] x x ViT'^) is a classical solution to the 
first order master equation if 

• U is continuous in all its arguments (for the di distance on V{T'^)), is of class in x 
and in time (the derivatives of order one in time and space and of order two in space 
being continuous in all the arguments), 

• U is of class with respect to m, the first order derivative 

su 

[0,T] X X V(T'^) X 9 (t,x,m,y) -— (t,x,m,y), 

dm 

being continuous in all the arguments, 6Uf6m being twice differentiable iny, the derivatives 
being continuous in all the arguments, 

• U satisfies the master equation (f28]l . 

Theorem 2.8. Assume that F, G and H satisfy (1261) and (1271) in Subsection 1^.31 and that 
(HFl(n+l)) and (HGl(n+2)) hold for some n ^ 1 and some a e (0,1). Then the first order 
master equation (fTHI) has a unique solution. 

Moreover, U is (in all variables), is continuous in all variables and U{t,-,m) and 
|^(t,-,m, •) are bounded in and x ^”-+1+“ respectively, independently of{t,m). 
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Finally, is Lipschitz continuous with respect to the measure variable: 


sup sup (di(mi,m 2 )) ^ 
ts[ 0 ,r] mi^m 2 


5U, 


5U, 


(n+2+Q;,n+Q;) 


< 00 . 


Section [3] is devoted to the proof of Theorem 12.81 We also discuss in this section the link 
between the solution U and the derivative of the solution of a Hamilton-Jacobi equation in the 
space of measure. 

The proof of Theorem 12.81 relies on the representation of the solution in terms of the mean 
field game system: for any {to, mo) e [0,r) x V{T'^), the MFG system is the system of forward- 
backward equations: 

r —dtu — Au + H{x,Du)=F{x,m{t)) 

< dtm — Am — div{mDpH{x, Du)) = 0 (29) 

[ u{T, x) = G{x, m(T)), m{to, ■) = mo 

As recalled below (Proposition 13.Ih . under suitable assumptions on the data, there exists a 
unique solution {u, m) to the above system. Our aim is to show that the map U defined by 


U{to,-,mo) := u{to,-) (30) 

is a solution to ((28l) . The starting point is the obvious remark that, for U defined by (l30|) and 
for any he [0,T — to], 

u{to + h,-) = U{to + h, ■,m{to + h)). 

Taking the derivative with respect to h and letting h = 0 shows that U satisfies (j28p . 

The main issue is to prove that the map U defined by (|30l) is sufficiently smooth to perform 
the above computation. In order to prove the differentiability of the map U, we use a flow 
method and differentiate the MFG system ()29p with respect to the measure argument mo- The 
derivative system then reads as a linearized system initialized with a signed measure. Fixing a 
solution (u,m) to (|29ll and allowing for a more singular initial distribution fig £ (C”'^^'''"(T'^))' 
(instead of a signed measure), the linearized system, with (u,/i) as unknown, takes the form: 

f 6F 

—dtv — Av + DpH{x, Du) ■ Dv = -— (x, m{t)) (/u(f)) 

< dt/a — A/a — dw(^/iDpH{x, Du)) — d\v{mD'i H{x, Du)Dv) = 0 
6G 

v{T,x) = —{x,m{T)){/i{T)), /i{to,-) = /lo- 

We prove that v can be interpreted as the directional derivative of U in the direction /lo- 

v{to,x) = [ ^{to,x,mo,y)/io{y)dy. 

J-^d dm 

Note that this shows at the same time the differentiability of U and the regularity of its derivative. 
For this reason the introduction of the directional derivative appears extremely useful in this 
context. 


2.4.2 The mean field game system with common noise 

As explained in the previous subsection, the characteristics of the first order master equation (j28p 
are the solution to the mean field game system (j29p . The analogous construction for the second 
order master equation (with /? > 0) yields to a system of stochastic partial differential equations, 
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the mean field game system with common noise. Given an initial distribution mo e P(T'^) at an 
initial time to £ [0,?"], this system read^ 

f dtut = {-(1 + + H{x,Dut) - F{x,mt) - 2^div(ut)}dt + vt ■ \^dWt, 

< dtmt = [(1 + (3)Amt + div[mtDpH{mt, Dut))]dt - ^/ifidiv{mtdWt), in [to,r] x T'^, 

[ m^Q = mo, ut{x) = G{x,mT) in T'^. 

(31) 

Here {Wt)te[o,T] is a given d—dimensional Brownian motion, generating a filtration (-^t)te[o,T]- 
The solution is the process {ut,mt,vt)te[o^T]-: adapted to where, for each t e [to,T], 

Vt is a vector field which ensures the solution (ut) to the backward equation to be adapted to the 
filtration {Ft)te[to,T]- Up to now, the well-posedness of this system has never been investigated, 
but it is reminiscent of the theory of forward-backward stochastic differential equations in finite 
dimension, see for instance the monograph |59] . 

To analyze (I3ip . we take advantage of the additive structure of the common noise and perform 
the (formal) change of variable 

ut{x) = ut{x + s/^Wt), mfix) = mt{x + y^Wt), xeT'^, te[0,r]. 

Setting ^i(x,p) = H{x + \/2Wt,p), Ft{x,m) = F{x + \/2Wt,m) and Gt{x,m) = G{x + \^Wt,m) 
and invoking the Ito-Wentzell formula (see Section 0] for a more precise account), the pair 
{ut,mt)te[to,T] formally satisfies the system 

( dtUt = {-Aut + Ht{-, Dut) - Ft{-,mt)]dt + dMt, 

I dtmt = {Arht + div{mtDpHt{-,Dut))}dt, (32) 

[ rhto = mo, UT = G{-,mT). 

where (still formally) dMt = vfix -I- ■\/2Wt)dWt. 

Let us explain how we understand the above system. The solution {ut)te[o,T] is seen as an 
(•^t)is[o,T]-adapted process with paths in the space C'^([0,T],C"'''‘^(T'^)), for some fixed n ^ 
0. The process imt)te[o,T] reads as an (Tt)^g[o,T]-adapted process with paths in the space 
C°([0,T],'P(T'^)). We shall look for solutions satisfying 

sup (||tttj|n+ 2 +a) e L®(H,^,P), (33) 

is[0,T] 

(for some fixed a e (0,1)). The process (Mt)tg[o,T] is seen as an (Ti) 4 g[o,T]-adapted process with 
paths in the space C°([0,T],C”(T'^)), such that, for any x e (Mt(x))tG[o,T] is an (Tt) 4 g[o,T] 
martingale. It is required to satisfy 

sup (iMtlU+a) e L^(Q,A,F). (34) 

te[0,T] 

Theorem 2.9. Assume that F, G and H satisfy ()26l) and (|27p and that (HFl(n+l)) and 
(HGl(n+2)) hold true for some n ^ 0 and some a e (0,1). Then, there exists a unique 
solution {ut,mt, Mt)te[o,T] to (1321) . satisfying (l33P and (IMp . 

We postpone the discussion of the existence of the solution to the true MFG system with 
common noise (I3ip to the next section, where the master equation allows to identify the correc¬ 
tion term (ut)tg[o,T]- 

^ In order to emphasize the random nature of the functions u and m, the time variable is now indicated as an 
index, as often done in the theory of stochastic processes. 
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Theorem 12.91 is proved in section 0] (see Theorem 14.31 for more precise estimates). The main 
difference with the deterministic mean field game system is that the solution (ut, mt)os;ts;r 
is sought in a much bigger space, namely [C°([0,T],C"'(T'^)) x C°([0,T],P(T'^))]^, which is 
not well-suited to the use of compactness arguments. Because of that, one can can no longer 
invoke Schauder’s Theorem to prove the existence of a solution. For this reason, the proof 
uses instead a continuation method, directly inspired from the literature on finite dimensional 
forward-backward stochastic systems (see [M])- Notice also that, due to the presence of the 
noise iWt)te[(},T]j the analysis of the time-regularity of the solution becomes a challenging issue 
and that the continuation method permits to bypass this difficulty. 


2.4.3 Second order master equation 

The second main result of the paper concerns the analogue of Theorem 12.81 when the underlying 
mean-field game problem incorporates an additive common noise. Then the master equation 
(I28p then involves additional terms, including second order derivatives in the direction of the 
measure. It has the form (for some fixed level of common noise /3 > 0): 


—dtU (t, X, m) — {1 + (3)AxU{t, x,m) + H (x, DxU{t, x, m)) — T(x, m) 

-{l + (3) divy[DjnU]{t,x,m,y)dm{y) + / DmU{t,x,m,y) ■ DpH{y,DxU{t,y,m))dm{y) 


Ijd 


Tr 


iTdxjd 


-2/3 [ diVx[DmU](t,x,m,y)dm{y) -/3 [ 

for {t,x,m) e [0,T] x x r{T'^) 
U{T,x,m) = G{x,m), for (x,m) 6 x T’(T'^). 

Following Definition 12.71 we let 


DmmU{t,x,m,y,y') dm{y)dm{y') = 0, 


(35) 


Definition 2.10. We say that a map U : [0,T] x x V{T'^) is a classical solution to the 
second order master equation (I35|) if 

• U is continuous in all its arguments (for the di distance on V{T'^)), is of class in x 
and in time (the derivatives of order one in time and space and of order two in space 
being continuous in all the arguments), 

• U is of class with respect to m, the first and second order derivatives 

err 

[0,T] X X ViT^) X 3 {t,x,m,y) >—>■ - — {t,x,m,y), 

dm 

5‘^U 

[0,r] X T"* X T’(T‘^) X X T'^ 9 {t,x,m,y,y') ^ j^{t,x,m,y), 

being continuous in all the arguments, the first order derivative 6Uf6m being twice differ¬ 
entiable in y, the derivatives being continuous in all the arguments, and the second order 
derivative jdmf being also twice differentiable in the pair {y,y'), the derivatives being 

continuous in all the arguments, 

• the function Dy{6U/5m) = Dmll is differentiable in x, the derivatives being continuous in 
all the arguments, 

• U satisfies the master equation ([35]). 
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On the model of Theorem 12.81 we claim 


Theorem 2.11. Assume that F, G and H satisfy (IMj) and dSiD in Subsection I £.91 and that 
(HF2(n+l)) and (HG2(n+2)) hold true for some n ^ 2 and for some a e (0,1). 

Then, the second-order master equation (l85]l has a unique solution U. 

The solution U enjoys the following regularity: for any a' e [0,0;), t e [0,T] and m e 
U{t,-,m), [6U/6m]{t,-,m,-) and •, m, •, •) are in , ^"■+2+“' x c^+^+F and 

Qn+ 2 +oi ^ Qn+a ^ Qn+a respectively, independently of {t,m). Moreover, the mappings 

[0, T] X nr^^) 3 {t, m) ^ U{t, ■,m) e 

[0, T] X ViT^^) 3 {t, m) ^ [6U/5m]{t, •, m, •) e C”+2+«' x C”+i+“', 

[0, T] X 9 {t, m) ^ [6^U/6m^]{t, ■,m, ■, •) 6 C”+2+“' x [C”+“']2 


are continuous. When a' = 0, these mappings are Lipschitz continuous in m, uniformly in time. 

Section [5] is devoted to the proof of Theorem 12.111 As for the first order master equation, 
the starting point consists in letting, given (to, mg) e [0,T] x V(T‘^), 

U(to,x,mo) = uto(x), xeT'^, 

where (ut,rht, Mt)te[o,T] is the solution to the mean field game system with common noise (1321) . 
when (lTi)ig[o,T] iii the definition of the coefficients F, G and H is replaced by iWt — WtQ)te[tQ,T]- 
The key remark (see Lemma EU), is that, if we let mt^^t = [Id + \f2{Wt — Wtoff^frit, then, for 
any /i e [0, T — to], IP almost surely, 

utf,+h(x) = U(to + h,x + \/2(WtQ+h - Wto),mtQ^to+h), x 6 T'^. 


Taking the derivative with respect to at /i = 0 on both sides of the equality shows that the 
map U thus defined satisfies the master equation (up to a tailor-made Ito’s formula, see section 
I5.4.4j) . Of course, the main issue is to prove that U is sufficiently smooth to perform the above 
computation: for this we need to show that U has a first and second order derivative with 
respect to the measure. As for the deterministic case, this is obtained by linearizing the mean 
field game system (with common noise). This linearization procedure is complicated by the fact 
that the triplet (ut,rht, Mt)t^[o,T] solves an equation in which the coefficients have little time 
regularity. 

As a byproduct of the construction of the master equation, we can come back to the MFG 
system with common noise. Let U be the solution of the master equation (135 j) . 

Corollary 2.12. Given to e [0,T], we call a solution to (|3T]) a triplet (ut,mt,vt)t^[to,T] of 
(Ft)te[to,T]-adapted processes with paths in the space C°([to,T],C^(T'^) x ViT'^) x C^(T^)) such 
that sup^g|-jjj 7 .](||Mt ||2 + Il'Wtlli) e L°°(n,.4.,P) and (ISTTl holds true with probability 1. Under the 
assumptions of Theorem \2.11\. for any initial data (to, mo) e [OjT] x V(T'^), the stochastic 
mean field game system dam has a unique solution (ut,mt,vt)te[o,T]j where (ut,mt)te[o,T] is an 
(Ft)te[o,T]-adapted processes with paths in the spaces ^*^([0,T],C”(T^) x V(T‘^)) and where the 
vector field (vt)te[o,T] is given by 



DmU(t,x,mt,y)dmt(y). 
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2.4.4 The convergence of the Nash system for N players 

We finally study the convergence of Nash equilibria of differential games with N players to the 
limit system given by the master equation. 

We consider the solution of the Nash system: 


j 

+ 2 DpH{xj,D^y’^) • = F{xi, m^’') in [0, T] x T 

iT,x) = Gixi, ’*) in 


Nd 


(36) 


where we have set, for x = (xi,..., xv) £ {T^)^, = tt- ^xj ■ 

N — 1 ~ ^ 

Let us recall that, under the same assumptions on H, F and G as in the statement of 
Theorem 12.Ill the above system has a unique solution (see for instance |46j i. 

Our main result says that the “converges” to the solution of the master equation as 
N —> + 00 . This result, conjectured in Lasry-Lions |50] . is somewhat subtle because in the Nash 
system players observe each other (closed loop form) while in the limit system the players just 
need to observe the theoretical distribution of the population, and not the specific behavior of 
each player. We first study the convergence of the functions and then the convergence of 
the optimal trajectories. 


We have two different ways to express the convergence of the described in the following 
result: 

Theorem 2.13. Let the assumption of Theorem \2.11\ be in force for some n ^ 2 and let (x'^’*) 
be the solution to (l36]l and U be the classical solution to the second order master equation. Fix 
N ^ 1 and {tQ,mo) e [0,T] x V{T'^). 

(i) For any x e (T'^)'^, let := Then 

1 ^ 

_ ^ \v^’\to,x) - U{to,Xi,m^)\ ^ GN-^. 


(a) For any z e {1,... ,N} and x e let us set 


w^'\to,x,mo) 



v^’\to,x)Y[mo{dxj) 

jj^i 


where x = (xi ,..., xv) • 


Then 




CiV-i/'i if d ^ 3 

CiV-i/2 log(iV) if d = 2 


In (i) and (ii), the constant G does not depend on i, to, mo, i nor N. 

Theorem 12.131 savs. in two different ways, that “in average”, the (x'^’*) are close to U. The 
first statement explains that, for a fixed x e (T'^)'^, the quantity |x'^’*(to, x) — U{to,Xi, m^’*)| is, 
in average over i, of order In the second statement, one fixes a measure mo and an index 

i, and one averages in space v^’^{to, •) over mo for all variables but the z—th one. The resulting 
map zx'^’* is at a distance of order of U{to, ^mo). 
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Because of the lack of estimates for the uniform with respect to N, we do not know if 
it is possible to avoid the two averaging procedures in the above results. However, if one knows 
that the solution of the Nash system has a (locally uniform) limit, then this limit is necessarily 
U: 

Corollary 2.14. Under the assumption of Theorem \2.13l let {t,xi,m) e [0,T] x x 'P{T^) be 
fixed and assume that there exists u e M such that 

limsup \v^'^{t,x') — u| = 0. 

A^^+00, 

A x' 

Then, if xi belongs to the support of m, we have v = U{t,xi,m). 


We can also describe the convergence in terms of optimal trajectories. Let to e [0,T), mo £ 
P(T'^) and let (Zi) be an i.i.d family of N random variables of law mo- We set Z = (Zi ,..., Z]\[). 
Let also ((Bt)te[o,T])ie{i,...,N} be a family of N independent Brownian motions which is also 
independent of (Zi) and let (Wt)ts[Q,T] be a Brownian motion independent of the (B*) and (Zf). 
We consider the optimal trajectories {Yt = (ki,t, • • • ,YN,t))te[to,T] for tbe N"—player game: 

f dY,^t = -DpH(Y,,t, Yt))dt + V2dBi + ^dWt, t e [to, T] 

1 ^i,to = Zi 


and the solution (Xt = (Xi^t, ■ ■ ■, of stochastic differential equation of McKean- 

Vlasov type: 

I dXi^t = -DpH (^X,,t,D,U(t,X^,t,C(XifiW))'^ dt + V2dBi + ^dWt, 

1 Xi^tf, = Zi- 


Both system of SDEs are set on Since both are driven by periodic coefficients, solutions 

generate (canonical) flows of probability measures on (T^)-^: The flow of probability measures 
generated in V{(T'^)^) by each solution is independent of the representatives in of the T'^- 
valued random variables Zi,..., Z^. 

The next result says that the solutions of the two systems are close: 


Theorem 2.15. Let the assumption of Theorem \2.1d\ he in force. 
z 6 {1,... , N}, we have 


E 

sup 

Yi,t 

-Xig 


_iG[Zo,T] 


_ 


^ CN d+s 


Then, for any N ^ 1 and any 


for some constant C > 0 independent of to, mo and N. 


In particular, since the (W,t) are independent conditioned on W, the above result is a 
(conditional) propagation of chaos. 

The proofs of Theorem 12.131 and Theorem 12.151 rely on the existence of the solution U of the 
master equation (|3^ and constitute the aim of Section [H Our starting point is that, for any 
N ^ 1, the “projection” of U onto the finite dimensional space [0, T] x (qpd)7V 

is almost a solution 

to the Nash system (f36[) . Namely, if we set, for any z e {1,... , N} and any x = (xi,..., xn) e 

(qpd)7V, 

u^’fit,x) := U{t,Xi,m^’"), 
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then satisfies ([36]) up to an error term of size 0{1/N) for each equation (Proposi¬ 

tion [63I)- Note that, as the number of equations in (f36|) is N, this could yield to a serious issue 
because the error terms could add up. The strategy of proof consists in controlling the error 
terms by exploiting the symmetry of the Nash system along the optimal paths. 

One of the thrust of our approach is that, somehow, the proofs work under the sole as¬ 
sumption that the master equation (13511 admits a classical solution. Here existence of a classical 
solution is guaranteed under the assumption of Theorem 12.111 which includes in particular the 
monotonicity properties of F and G, but the analysis provided in Section [6] shows that mono¬ 
tonicity plays no role in the proofs of Theorems l2.13l and l2.15l Basically, only the global Lipschitz 
properties of H and DpH, together with the various bounds obtained for the solution of the 
master equation and its derivatives, matter. This is a quite remarkable fact, which demonstrates 
the efficiency of our strategy. 
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3 A starter: the first order master equation 


In this section we prove Theoreni l2.8l i.e., we establish the well-posedness of the master equation 
without common noise; 

—dtU{t, X, m) — AxU{t, x,m) + H (x, D^U{t, x, m)) — / divj^ [DmU] {t, x, m, y) dm{y) 

Jjd- 

< + DjnU{t,x,m,y) ■ DpH{y,D^U{t,y,m)) dm{y) = F{x,m) 

in [0, T] X T"* X V(T'^) 

U{T,x,m) = G{x,m) in x P(T'^) 

(37) 

The idea is to represent U by solutions of the MFG system: let us recall that, for any {to, mo) e 
[0,T) X P(T'^), the MFG system is the system of forward-backward equations: 

f —dtu — Au + H{x,Du)=F{x,m{t)) 

< dtm — Am — div{mDpH{x, Du)) = 0 (38) 

[ u{T, x) = G{x, m{T)), m{to, ■) = mo 

As recalled below, under suitable assumptions on the data, there exists a unique solution {u, m) 
to the above system. Our aim is to show that the map U defined by 

U{to,-,mo) := u{to,-) (39) 

is a solution to dST]). 

Throughout this section assumptions (1261) and (|27p are in force. Let us however underline 
that the global Lipschitz continuity of H is not absolutely necessary. We just need to know 
that the solutions of the MFG system are uniformly Lipschitz continuous, independently of the 
initial conditions: sufficient conditions for this can be found in [50] for instance. 

The proof of Theorem 12.81 requires several preliminary steps. We first recall the existence 
of a solution to the MFG system (l38l) (Proposition 13.ip and show that this solution depends 
in a Lipschitz continuous way of the initial measure mo (Proposition 13.21) . Then we show by a 
linearization procedure that the map U defined in (I39p is of class with respect to the measure 
(Proposition 13. 8[ Gorollarv 13.9h . The proof relies on the analysis of a linearized system with a 
specihc structure, for which well-posedness and estimates are given in Lemma 13.41 and Lemma 
We are then ready to prove Theorem 12.81 (subsection 13.5p . We also show, for later use, 
that the hrst order derivative of U is Lipschitz continuous with respect to m fProposition 13. lip . 
We complete the section by explaining how one obtains the solution U as the derivative with 
respect to the measure m of the value function of an optimal control problem set over flows of 
probability measures (Theorem [3T2|). 

Some of the proofs given in this section consist of a sketch only. One of the reason is 
that some of the arguments we use here in order to investigate the MFG system (j38p have been 
already developed in the literature. Another reason is that this section constitutes a starter only, 
specifically devoted to the simpler case without common noise. Arguments will be expanded in 
detail in the two next sections, when handling mean-field games with a common noise, for which 
there are much less available results in the literature. 

3.1 Space regularity of U 

In this part we investigate the space regularity of U with respect to x. Recall that U{to, •,rno) 
is defined by 

U{to,x,mo) = u{to,x) 
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where {u,m) is a classical solution to (|38p with initial condition m(to) = itiq. By a classical 
solution to (l38l) we mean a pair {u,m) e x T],7^(T‘^)) such that the equation for u 

holds in the classical sense while the equation for m holds in the sense of distribution. 

Proposition 3.1. Assume that (HFl(n)) and (HGl(n+2)) hold for some n ^ 0. Then, 
for any initial condition {to, mo) e [0,T] x P(T'^), the MFG system (f88l) has a unique classical 
solution (u, m) and this solution satisfies 


sup 




+ H llD‘u 


1+q/2,2+q; 


^Cn, 


(40) 


where the constant Cn does not depend on {to,mo)- 

If moreover mo is absolutely continuous with a smooth positive density, then m is of class 
^i+a/ 2 , 2 +a q smooth, positive density. 

Note that further regularity of F and G improves the space regularity of u but not its time 
regularity (as the time regularity of the coefficients depends upon that of m, see Proposition 13.II 
right above). By ()40]l . we have, under assumptions (HFl(n)) and (HGl(n+2)) 


sup sup \\U{t,-,m)\\n+ 2 +a ^ Cn- 

ts[0,T] mcV{T‘^) 

Proof. We provide a sketch of proof only. Existence and uniqueness of classical solutions for (I55|l 
under assumptions (HFl(n)) and (HGl(n+2)) for n = 0 are standard: see, e.g., [38lll9]. Note 
that we use here the Lipschitz continuity assumption on H, which guaranties uniform Lipschitz 
estimates on u. 

We obtain further regularity on u by deriving in space n times the equation for u. 

When mo has a smooth density, m satisfies an equation with exponents, so that by 

Schauder theory m is j£ moreover, mo is positive, then m remains positive by strong 

maximum principle. □ 


3.2 Lipschitz continuity of U 

Proposition 3.2. Assume that (HFl(n + 1)) and (HGl(n+2)) hold for some n ^ 0. Let 
mg,mg 6 V{T^), to £ [0, T] and {u^,m^), {u^,m?) he the solutions of the MFG system (f38ll with 
initial condition {to,m}f) and (to,w-o) respectively. Then 

sup \di{m^{t),m‘^{t))+ \\u^{t,-)-u‘^{t,-)\\ } ^ C'ndi(mJ,mg), 

te[0,T] I- lln+Z+a) 

for a constant Cn independent of to, mg and mg. In particular, 

\\U{to,-,ml) - t/(to,-,’7rg)||^_^2+« ^ C'ndi(mg,mg). 

Proof. First step. To simplify the notation, we show the result for tg = 0. We use the well-known 
Lasry-Lions monotonicity argument (see the proof of Theorem 2.4 and Theorem 2.5 of [50]): 

^ [ {u^{t,y) - u‘^{t,y)){m^{t,y) - m‘^{t,y))dy 

at Jjd 

^ [ ]-\Du^{t,y) - Du^{t,y)\'^{m^{t,y) + m?{t,y))dy 

y-j-d z 
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since F is monotone, Du^ and Du^ are uniformly bounded and H satisfies (I26jl . So 


f f \Du^{t,y) - +m^{t,y)) dydt 

Jo JT'i 

{u^{t,y) -u‘^{t,y)){m^{t,y) -m^{t,y))dy 


^ C 


T 

Jo 


At time T we use the monotonicity of G to get 


f {T, y) - {T, y)){m^ (T, y) - (T, y))dy 

= [ {G{y,m\T))-G{y,m^{T))){m\T,y)-m‘^{T,y))dy^0. 

Jfd 

At time 0 we have by the definition of di, 

[ {u^{0,y) -u‘^{0,y)){ml{y) -ml{y))dy ^ C'||D(u^ - u^)(0, •)lloodi(mj, mg). 


Hence 

rT 


[ [ y) + w?{t, y))\Dv}'{t, y) - Du^{t, y)\^dydt ^ C\\D{u^ - u^)(0, •)iloodi(mg, mg). 

Jo Jt'^ 

(41) 

Second step: Next we estimate m^ — m?: to do so, let (H, J”,P) be a standard probability 
space, Ag, Xq be random variables on H with law mg and mg respectively and such that 
E[|Aq — Aq|] = di(mQ,mg). Let also (Ag), (Ag) be the solutions to 

dXi = -DpH{Xi, Du\t, Xi))dt + V2dBt t e [0, T], i = 1, 2, 

where {Bt)te[o,T] is a d—dimensional Brownian motion. Then the law of Xf is m*(t) for any t. 
We have 

E[|Ag-Ag|] ^ E[|Ag-A2|]+E {\DpH{XlDu\s,Xl)) - DpH{XlDu\s,X^^)) 

+ \DpH{xlDu\s,X^))-DpH{xlDu\s,X^,))\ ds 


As the maps x >—>■ DpH{x, Du^{s, x)) and p >—>■ DpH{x,p) are Lipschitz continuous (see (l26l) and 
Proposition 13.21) : 

E[|Ag-Ag|] 

^ E[|Ag — Agl] + C f E[|Ag — Ag|]ds + C f f \Du^{s,x) — Du‘^{s,x)\m‘^{s,x)dxds 
Jo Jo Jjd- 

B r \ 

I / \Du^(s,x) — Du‘^(s,x)\'^m‘^(s,x)dxds] 

0 / 


^di(mg,mg) + C f E[|Ag - Ag|]ds + C 
Jo 

In view of (j41h and Gronwall inequality, we obtain 


E[|Ag - Ag|] ^ G di(mg,mg) + ||L>(u^ - u^)(0, •)ll»^di(mg, mg)^/^ 


(42) 
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(43) 


As di(m^(t),^ IE[|Xj — A||], we get therefore 

sup di(m^(t),m^(t)) ^ C [di(mo, mg) + - m^)( 0, •)llM^di(mo, . 

iG[0,T] L J 

Third step: We now estimate the difference w := v} — v?. We note that w satisfies: 

[ —dtw{t, x) — Aw{t, x) + V{t, x) ■ Dw{t, x) = x) in [0, T] x 
I w{T, x) = Rt{x) in T'^ 

where, for (t, x) 6 [0, T] x T*^, 


V{t,x) 



DpH{x, sDv}{t, x) + (1 — s)Du^{t, x)) ds, 


n 6F 

— {x,sm^{t) + (1 - 3)171^- w?{t,y)) dyds 
■d dm 

and ^ 

^ L Ld + (1 “ s)m‘^{T),y){m^{T,y) - m‘^{T,y)) dyds. 

By assumption (HFl(n + 1)) and inequality ([451) . have, for any t e [0,T], 


D^Ri(t,-) 




n+l+a 


[ Dy^{-,sm^{t) + {1 - s)m‘^{t),-) 

Jq out Qn + l + oc l^OD 

^ c\di{ml,ml) + ||Dt(;(0,-)||*^di(mo,mo)^/^ 


ds di{m^(t), (t)) 


and, in the same way (using assumption (HGl(n+2))), 


\\Rt 


n+2+a 


^ c 


di{ml,ml) + \\Dw{0, ■)\\W^di{ml,ml)^/‘^ 


On another hand, V (t, •) is bounded in in view of the regularity of and (Proposition 

EH). Then Lemma 13.31 below states that 


sup \\w{t,-)\\n+ 2 +a ^ 
ts[o,r] 


< 


C 

C 


|.RT||,^+2+a T sup ||i?i(t, •)||n,+l+a 
te[0,T] 

di(mo,mo) + ||L>r(;(0, •)||^^di(mo, 


Rearranging, we find 

sup \\w{t,-)\\n+ 2 +a ^ C'di(mo,mo), 
te[o,r] 

and coming back to inequality (1451) . we also obtain 

sup di(m^(t),m^(t)) ^ Cdi(mo,mg). 
te[o,T] 


□ 


In the proof we used the following estimate; 
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Lemma 3.3. Letn>l,V e C°([0,T], M"*)) and f e C°([0,r],C’^-i+"(T'^)). Then, 
for any zt e the (backward) equation 

{ —dtz — IS.Z + V{t,x) ■ Dz = f{t,x), in [0, T] X T'^ 

I z{T,x) = zt{x) 

has a unique solution which satisfies 


sup ||2(t,-jlln+a + sup-j- ^ C 

ts[0,T] t/t' \t'-t \2 


\\ZT\\n+a+ sup 

is[0,T] 


||/(^) OIU—1+0 


where C depends on sup^gj-o^T^j ||y(i, •)||n-i+o- 

Proof. Beside the time estimate, Lemma 13.31 is a particular case (in the deterministic setting) 
of Lemma 14.41 So we postpone this part of the proof to section [H 
We now prove the time regularity. By Duhamel formula, we have, 

z{t + h,-) - z{t, •) = {Pr-t-h - PT-t)zT + [ Ps-t-hip{s, ■)ds - 

J tfi-h 



where Pt is the heat semi-group and fi{s, ■) := 14(s,.) • Dz{s, ■) — f{s, ■). Hence, for 2h ^ T — t, 


z{t -I- h, •) 




n-\-a 




rt-{-2h 

Jt-{-h 



rt+2h 

\\{PT-t-h - PT-t)zT\\n+a + 

1 ’')\\n-\-ads 

fT 

Jt 

Ps—t—h'f^i.S-i')'i\n+ads-\- I | 

\{Ps-t-h - Ps-t)'llj{s, •) 

J t+2/i 


(44) 


Recalling the standard estimates \\{PT-t-h — PT-t)zT\\n+a ^ Ch'2\\zT\\n+a, \\Ps-t'>P{s, ■)\\n+a ^ 

1 3 

C{s - t)~2\\'ljj{s)\\n-l+a and \\{Ps-t-h - Ps-t)'tp{s,-^n+a ^ Ch{s - t - h)~ 2 W'ljj^s,-fin-l+a, We 
find the result when 2h ^ T — t. 

When 2h > T — t, there is no need to consider the integral from t + 2h to T m the above 
formula (1441) . and the result follows in the same way. □ 


3.3 Estimates on a linear system 


In the sequel we need to estimate several times solutions of a forward-backward system of linear 
equations. In order to minimize the computation, we collect in this section two different results 
on this system. The first one provides existence of a solution and estimates for smooth data. 
The second one deals with general data. 

We consider systems of the form 

f 6F J 

(z) —dtz — Az + V{t,x)-Dz = - — (x,m(t))(p(t)) + b(t,x) inlto,T]xT“ 

dm 

" (a) dtp — A/9 — div(pl/) — ddwijnTDz -I- c) = 0 in [to, T] x T“ (45) 

5G 

{a) z{T,x) = —(x,m(T))(p(r)) + zxix), p{to) = po in 


where V : [to;^] x ^ is a given vector field, m e C°([0, T],'P(T'^)), T : [0,T] x 
is a continuous map with values into the family of symmetric matrices and where the maps 
b : [to, r] X T'^ —> M, c : [to, T] x T'^ —> and zt : —> M are given. We always assume that 

there is a constant C > 0 such that 


Vt,t' e [to,T], di(m(t),m(t')) ^ C\t - t'\P^, 

V(t, x) 6 [to, T] X C-^Id ^ r(t, x) ^ Cld. 
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Typically, V{t,x) = DpH{x, Du{t,x)), T{t,x) = DppH{x, Du{t,x)) for some solution {u,m) 
of the MFG system (13811 starting from some initial data m{to) = mo- Recall that the derivative 
Du is globally Lipschitz continuous with a constant independent of {to, mo), so that assumption 
(|26]) gives the existence of a constant C for which ()i6]l holds. We note for later use that this 
constant does not depend on (to, mo). 

To simplify the notation, let us set, for n e N, Xn = and let (Xn)' be its dual 

space {{XnY = We first establish the existence of a solution and its smoothness 

for smooth data: 


Lemma 3.4. Assume that b, c, zt and po are smooth, V is of elass (^i+“/2,2+a^ p of class 
and (W'(t))fE[to,T] ® family of densities, which are uniformly bounded above and below 

by positive constants. Suppose furthermore that (HFl(n)) and (HGl(n+2)) hold for some 
n ^ 0. Then system (HSl) has a classical solution {z,p) e £^i+«/2,2+a ^ pi+“/2,2+o^ 

Moreover, the pair {z, p) satisfies the following estimates: 


I \ II I )||Tl + 2 + fl _ ,1 T f A^\ 

sup \\z{t, •) \\n+2+a + SUp-j^ CnM. (47) 

i£[io,r] 


and 


sup ||p(t)ll(x„+i)'+ sup 

te[to,T] 


p{t') - p{t)\\(x„+,y 

\t-t'\^ 


^ CnM, 


(48) 


where the constant Cn depends on n, T, sup^gj-^^j \\Vit, Yllx^+n Ihe constant C in (jlGl) . F and 
G (but not on the smoothness assumption on b, c, zt, Po, V, T and m) and where M is given 
by 

M := \\zt\\x „+2 + IIPoll(x„+i)' + sup (||6(t, Ollx^+i + ||c(t)||(x„)')- (49) 

te[to,T] 


Remark: if mo has a smooth density which is bounded above and below by positive con¬ 
stants and if (u, m) is the solution to (l38ll . then V{t,x) := DpH{x,Du{t,x)) and T{t,x) := 
DppH{x, Du{t,x)) satisfy the conditions of Lemma [3~41 


Proof. Without loss of generality we assume to = 0- We prove the existence of a solution to 
(14^ by Leray-Schauder argument. The proof requires several steps, the key argument being 
precisely the estimates ()17)l and (1481) . 


Step 1: Definition of the map T. Let j3 e (0,1/2) and set X := C^([0,T], (Xn+i)'). For p € X, 
we define T(p) as follows: let z be the solution to 

^ 6F 

—dtz — Az + V(t, x) ■ Dz = -T—(x, m{f)){p{t)) + b in [0, T] x T'^, 

Ar (50) 

z{T) =—{x,m{T)){p{T)) + ZT in 

^ ^TTh 

By our assumptions on the data, z solves a parabolic equation with coefficients, and, by 

Schauder estimates, is therefore bounded in when p is bounded in X. Next we define 

p as the solution to 

[ dtp — Ap — div(pF) — (i\v{mTDz -I- c) = 0 in [0, T] x T'^ 

I /o(0) = Po in T'^. 

Again by Schauder estimates p is bounded in for bounded p. Setting T(/9) := p 

defines the continuous and compact map T ■. X —> A. 
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In the rest of the proof we show that, if p = crT(p) for some (p, cr) e X x [0,1], then p 
satisfies (|48p . This estimate proves that the norm in X of p is bounded independently of a. 
Then we can conclude by Leray-Schauder Theorem the existence of a fixed point for T, which, 
by definition, yields a classical solution to (HSl) . 

From now on we fix (p, cr) e X x [0,1] such that p = ctT(p) and let z be the solution to (IHO]) . 
Note that the pair {z, p) satishes 

—dtz — IS.Z + V{t,x) ■ Dz = f ^—(®)”i(^))(p(^)) + ^j ™ [0)^] ^ 

< dtp — Ap — div(pF) — ad\v{rnTDz + c) = 0 in [0, T] x 

p(0) = fjpo, z{T) = a {^^{x,m{T)){p{T)) + ZT^ in T"*. 

Our goal is to show that (H71) and hold for z and p respectively. Without loss of generality 
we can assume that a is positive, since otherwise p = 0. 


Step 2: Use of the monotonicity condition. We note that 
d 


dt J fd 


z{t, x)p{t, x)dx 


= —a 


I Yd 


6F 


—a 


^^{x,m{t))[p{t)) + b{t) p{t,x)dx 
/ Dz{t,x) ■ \T{t,x)Dz{t,x)] m{t,x)dx — a / Dz{t,x) ■ c{t,x)da 

Jjd Jjd 


Using the monotonicity of F and G and dividing by a, we have: 

/ / T{t,x)Dz{t,x) ■ Dz{t,x)m{t,x)dxdt 

Jo Jjd 

^ + ZT{x)]p{T,x)dx 

+ / z{0,x)p(}{x)dx — / / (b{t,x)p{t,x) + Dz{t,x) ■ c{t,x))dxdt 

Jjd Jq JYd 

^ sup ||p(t)||(X„+i)'(ll^T||x„+i + 1|6||) + sup ||^(t,-)b„+i(||/5oll(X,+i)' + ||c||) 
ts[0,T] 4e[0,T] 

where we have set ||6|| = sup \\b{t, ■)\\x„+i, ||c|| := sup^gj-g^'p] ||c(t)j|(x„)'- Using assumption 

is[0,T] 

on r, we get: 

f f \T{t,x)Dz{t,x)\‘^m{t,x)dx 

■^0 / X (51) 

^C'f sup ||p(t)||(X„+i)'(!|2T!|x„+i + ll^ll) + sup ||2(t)||x„+i(l|po||(x„+i)' + llc||) ). 

Vtsro.rl tefO.Tl / 


Second step: Duality technique. Next we use a duality technique for checking the regularity 
of p. Let r e (0,T], ^ e X„+i and w be the solution to the backward equation 

— dtw — Aw + V (t, x) ■ Dw = 0 in [0, r] x T'^, w{t) = ^ in T'^. (52) 

Lemma 13.31 states that 


sup ||t(;(t, •)||n+i+a + sup 
ts[0,T] t#t' 


\\w{t',-) -w{t,-)\\n+l+a 
\t' - 




n+l+CK? 


(53) 
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where C depends on sup^gj-g^'p] ||y(t, •)||n+a- As 
d 


dt 


/ w{t,x)p{t,x)dx = —a / Dw{t,x) ■ (m{t,x)T{t,x)Dz{t,x) + c{t,x))dx , (54) 


we get (recalling that a e (0,1]) 

/ C{x)piT,x)dx 
JT<i 



0 JT‘^ 


= fj / w(0, x)po(x)dx — (7 

^ II w^(0) II l|po||(x„+i)'+ sup \\Dw{t,-)\\xJc\\ 

te[0,T] 

rT r \ / pT 

\Dw{t, x)\‘^m{t, x)dxdt j 
/o Jt"* / 


Dw{t,x) ■ {rn{t,x)T{t,x)Dz{t,x) + c{t,x))dxdt 




^ C 


0 Jt^. 


|r(t, x)Dz{t, x)\‘^m{t, x)dxdt 


1/2 


II ^ II ^n+1 ll/3o||(x„+i)'+ sup \\Dw{t,-)\\xJc\\ 

te[0,T] 


+ \\Dw\\, 



lo Jt‘^ 

Using (ISTI) and (|53|) we obtain therefore 


|r(t, x)Dz{t, x)f m(t, x)dxdt 


1 / 2 - 


[ C{x)p{T,x)dx ^ C'||^||x „+1 ||po||(x„+i)'+ i|ci| + sup ll/3(i)||[xl+i)'(ll^'rllx!+i + 

L tG[o,r] \ ^ 


JUP] Ollxli (IIPoll+ \\cf^^) 


Taking the supremum over ^ with j|.^||jv„+i ^ 1 and over r e [0,T] yields to 


sup ||p(t)||(x„+i)' ^ C 

iG[0,T] 


l|po||(x„+iy + ll^ll + Ilp(^)ll(4l+i)'(ll^r|l4!+i + ll^ll^^^) 

+ sup N(t,-)ii4!+i(iipo44l+i)' + iicr^^) 

tG[o,r] ^ ^ ' 


Rearranging and using the definition of M in (I57ji . we obtain 


sup ||p(i)||(x„+i)'^ C* 

te[0,T] 


M+ sup ||2(t,-)ll4!4 

ts[0,T] 


1/2 

■„+iVii^^uii(x„+iy 




(55) 


We can use the same kind of argument to obtain the regularity of p with respect to the time 
variable: integrating (I54p in time and using the Holder estimate in (j53h we have, for any r e 
[0,T], 


/ Cix){p{T,x) - p{t,x))dx 

JTd, 


= / (w{t,x) — w{t,x)) p{t,x)dx — a 
Jjd 



—a 


t 

Dw{s, x) ■ c(s, x)dxds 


Dw{s, x) ■ [r(s, x)Dz{s, x)]m(s, x)dxds 


^ f t 

^ C(r-t)4l?||x„+i sup ||p(t)||(x„+i)' 
te[0,T] 

+ (r - tf/‘^\\Dw\\a:, 

lo J^d 

+ (r-t) sup ||iu(t,-)||x„+il|c' 

iG[0,T] 



|r(s, x)Dz{s, x)\ m{s, x)dxds 


1/2 
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Plugging (l55]) into (l5T]i . we get that the root of the left-hand side in (fCT) satisfies the same 
bound as the left-hand side in (1551) . Therefore, 


/ ^{x){p{t,x) - p{t,x))dx 


^ C{t - t)^\\i\\x„+^ 




Dividing by (r — and taking the supremum over ^ yields 


IIKO -pWII(X„+i)' 

sup-f^ ^ C 

t#t' It — t'\ 2 




(56) 


Third step: Estimate of z. In view of the equation satisfied by z, we have, by Lemma 13.31 

\\z{t',-) - z{t,-)\\n+ 2 +a 


sup ||z(f, •)||n+2+a + sup ^ 

ts[ 0 ,r] \f-t \2 

r 6F 

^Ca sup -— [x,m{t)){p{t)) + b{t, ■) 

jg[o,t] bm 


n+l+Q; 


6m 


{x,miT))ipiT)) + ZT 


ri.+2+Q;- 


where C depends on supjg[o,T] ll^(L')lln+i+a- Assumptions (HFl(n-|-l)) and (HGl(n+2)) 
on F and G and the fact that a e [0,1] imply that the right-hand side of the previous inequality 
is bounded above by 


C 


sup ||p(t)ll(x„+i)' + ll^ll + IIp(2")I1(x„+ 2)' + lkT||x „+2 

ts[o,r] 


Estimate (f55]) on p then implies (since \piT)\{Xn+ 2 y ^ ll/°(^)ll(x„+i)')- 

\z{t', •) - Z{t, •)||n+2+« 


sup I|2:(t, •)|U+2+« +sup- 
tG[o,r] 




W -A- 




Rearranging we obtain (H7|) . Plugging this estimate into (15^ and (IS5|) then gives 


□ 


We now discuss the existence and uniqueness of the solution for general data. Given n ^ 2, 
zt e Xn+ 2 , Po e A'(j, be L*([0,T], A„), ce L°°([0,T], [(A„)']'^), we define a solution to to 
be a pair {z,p) e C°([0,T], A „+2 x {^n)') that satisfies (1451) in the sense of distribution. 

Here is our main estimate on system ()45p . 

Lemma 3.5. Let n ^ 0. Assume that (HFl(n-|-l)) and (HGl(n+2)) hold, that V e 
C''^([0,T], A„+i) and that 

M := ||2t||x„+2 + !IPoll(jv,+i)' + sup (||6(L-)l|x„+i + llc(t)||(x„)') < oo- (57) 

te[to,T] 

Then there exists a unique solution {z,p) of (l4^ with initial condition /?(to) = Po- This solution 
satisfies 

sup ||(z(f,-),/?(t))llx„+ 2 x(x„+i)' ^ CM, 
te[to,T] 

where the constant C depends on n, T, sup^gj-Qj^j ll^(t, t/ie constant C in (I46]l . F and 

G. 

Moreover this solution is stable in the following sense: assume that 
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• the data and converge to V, m, T and po respectively in the spaces C°([0, T] x 

CO([to,T],P(T'^)), C\[0,T] x and {Xn+i)', 

• the perturbations (b^), (c^) and converge to b, c and zt, uniformly in time, in Xn+i, 

in \{Xn)'Y Xn +2 respectively. 

Suppose also that the (defined by (1571) for the (6^), (c^), z!f and p^) are bounded above by M+ 
1 and that the sup^gj-g^'r] \\V^{t, ■)\\x„+i are uniformly bounded. Then the corresponding solutions 
{z^,p^) converge to the solution {z,p) of (j45]) in C°([to,7"],x 

Proof. By Lemma 13.41 existence of a solution holds for smooth data. We now address the case 
where the data are not smooth (so that M cannot be zero). We smoothen m, L, b, c, zt and pQ 
into m^, (b^), (c^), z^ and /?q in such a way that the corresponding is bounded by 2M 

and is a smooth density bounded above and below by positive constants. Let {z^,p^) be a 
classical solution to (I45p as given by Lemma 13.41 Using the linearity of the equation, estimates 
(H71) . (H8|) imply that {z^,p^) is a Cauchy sequence in C°([0, T],x C”+"(T‘^))') and 
therefore converges in that space to some limit {z, p). Moreover [z, p) still satisfies the estimates 
([^71) . (|18]) . By (HFl(n+l)) and (HGl(n+2)), (|^(•,m^)(p^)) converges uniformly in time 
to (|^(•’H(P)) while ( converges to {^{•,'m){p)). Therefore {z,p) is a weak 

solution to (1451) . 

Note also that any solution of (I45]l satisfies estimates (1471) . (1481) . so that uniqueness holds by 
linearity of the problem. □ 


3.4 Differentiability of U with respect to the measure 


In this section we show that the map U has a derivative with respect to m. To do so, we 
linearize the MFC system ([38]) . Let us fix (to,mo) e [0,T] x V(T‘^) and let {m,u) be the 
solution to the MFC system (|38p with initial condition m{to) = mo. Recall that, by definition, 
U{to,x,mo) = u{to,x). 

For any po in a suitable space, we consider the solution (u, p) to the linearized system 

( 5F 

—dtv — Av + DpH{x, Du) • Dv = - —(x, m{t)){p{t)) 

dm „ 

< dtp — Ap — dw{pDpH{x, Du)) — div{mDp H{x, Du)Dv) = 0 (58) 

6G 

v{T,x) = —{x,m{T)){p{T)), p{to,-) = po 
Our aim is to prove that U is of class with respect to m with 


v{to,x) 


f ^ 

/xd dm 


{to,x,mo,y)poiy)dy. 


Let us start by showing that the linearized system p58|) has a solution and give estimates on 
this solution. 


Proposition 3.6. Assume that (HFl(n+l)) and (HGl(n+2)) hold for some n ^ 0. 

(i) Let mo be a smooth density bounded below by a positive constant and let po be smooth map 
on T'^. Then there exists a unique solution {v,p) e (ji+“/ 2 , 2 +« ^ (ji+Q:/ 2 , 2 +a system (l58p . 

(a) If mo e P(T'^) and po e (C”'"'"^"''")', there is a unique solution {v,p) of (j58P (in the sense 
given in section Wf^) and this solution satisfies 

®np { II'a(f, •) ||n+2+a T IIM(^) II —(n+l+a)} ^ f7||P-O|| — (n+l+o)) 

te[to,T] 
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where the constant C depends on n, T, H, F and G (but not on {to, mo)). 

(Hi) The solution is stable in the following sense: assume that the triplet (ig,mQ,/iQ) converges 
to {to,mo, Ho) in [0,T] x V{T^) x Then the corresponding solutions {v^,jjf) 

to (f58j) (where {u^,mf) solves ([38]) with initial condition mf{tQ) = m^) converge to the 
solution {v,h) OTCO([to,T],C”+2+“(T‘^) x (C”+i+“(T‘^))'). 

Proof. It is a straightforward application of Lemmata 13.41 and 13.51 respectively, with V{t, x) = 
DpH{x, Du{t,x)), T{t,x) = DppH{x, Du{t,x)) and zt = b = c = 0. Note that V satisfies the 
condition that D^V belongs to T], in view of Proposition 13.11 □ 

Corollary 3.7. Under the assumptions of Proposition there exists, for any {to, mo), a 
Qn+2+a X {x,y) ^ K{to,x,mo,y) such that, for any ho £ 

the V component of the solution of (l58]) is given by 


v{to,x) = (K{to,x,mo,-),Ho)c n +1 + a ( fd ) ^ n +1 + a ( T ) )' • 


(59) 


Moreover 

\\K {to, ', mo, ')\\(^n+2+Q,n+l+a) ^ 

and K and its derivatives in {x,y) are continuous on [0, T] x T'^ x V{T‘^) x 

Proof. For i e with \(.\ ^ n + 1 and y 6 T'^, let {v^^'){■, ■, y), , y)) be the solution to 

(|58p with initial condition hq = D^Sy (the 1'—th derivative of the Dirac mass at y). Note that 
Ho e (C”+^+“(T‘^))'. We set K{to,x,mo,y) := v^^'i{to,x,y). 

Let us check that dy.^K{to,x,mo,y) = —v^^^\to,x,y) where e\ = (1,0,... ,0). Indeed, since 
€~^{dy+eei —^y) converges to in {C^+i+ay by linearity, the map e~^{K{-, ■,mo,y + 

eei)—K{-,-,mo,y)) is the first component of the solution of (l58|) with initial condition e~^{6y_^_^f,^) — 
6y), this map must converge by stability (point (iii) in Proposition 13.6p to the first component 
of the solution with initial condition —D^^5y, which is — ■,y). This proves our claim. 

One can then check in the same way by induction that, for ll'l ^ n + 1, 

D^yK{to,x,mo,y) := {-l)''^''v^^\to,x,y). 

Finally, if \t\ ^ n + 1, point (ii) in Proposition 13.61 combined with the linearity of system (15811 
implies that 


K^^\to, ■,mo,y) - K^^\to, ■,mo,y') 


n+2+a 




< 


C\\5y - Vll-a ^ C\y - y'l" 


Therefore K{to, ■,mo, ■) belongs to x Continuity of K and its derivatives in 

{to, mo) follows from point (iii) in Proposition 13.61 □ 


We now show that K is indeed the derivative of U with respect to m. 

Proposition 3.8. Assume that (HFl(n+l)) and (HGl(n+2)) hold for some n ^ 0. Fix 

to e [0,T], mo,mo e V{T‘^). Let {u,m) and {u,m) be the solution of the MFG system ([38]l 
starting from {to, mo) and {to, mo) respectively and let {v,h) be the solution to (l58]l with initial 
condition {to,mo — mo). Then 

sup {I|h(t, •) -u{t,-) -v{t,-)\\n+ 2 +a + i|m(L') “ m(t, •) - p(L') ll-(n+l+a) } ^ Cdi (mo, mo). 
te[to,T] 
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As a straightforward consequence, we obtain the differentiability of U with respect to the 
measure: 


Corollary 3.9. Under the assumption of Provo sition \3^ the map U is of class (in the sense 
of Definition \2.1]} with 

— {to,x,mo,y) = K{to,x,mo,y), 
whose regularity is given by Corollary \3.7[ Moreover, 


U{to,-,mo) 


f su 

U{to,-,mo)- / —{to,-,mo,y)d{mQ-mo)iy) 
Jjd dm 


^ Cdj{mo,rho). 

ri.+2+a 


Remark 3.10. Let us recall that the derivative 6Uf6m is defined up to an additive constant 
and that our normalization condition is 



,x,mo,y)dmo{y) = 0. 


Let us check that we have indeed 


/ K{t,x,mo,y)dmo{y) = 0. (60) 

JTd, 

For this let us chose yo = mo in (l58|) . Since, by normalization condition, |^(t, m(t))(m(t)) = 0, 
for any t e [0, T], and ^(t, m(T))(m(T)) = 0, it is clear that the solution to is just 
(v,fi) = (0,m). So, by 1(5^ . (fGOll holds. 

Proof of Proposition \3.^ Let us set z ;= u — u — u and p := m — m — fi. The proof consists in 
estimating the pair {z,p), which satishes: 


/" 


< 


6F 

—dtz — Az + DpH{x, Du) • Dz = - — (x, m{t)){p{t)) + b 

dm 

dtp — A/9 — d\Y{pDpH (x, Du)) — diY{mDp H{x, Du)Dz) — div(c) 
6G 

z{T,x) = —(x,m(r))(p(T)) +zt{x), p(to,-) =0, 
dm 


= 0 


where 

with 

and 


b{t,x) = A{t,x) + B{t,x) 

Aft, x) = — f {DpH{x, sDu + (1 — s)Du) — DpH{x, Du)) ■ D(u — u) ds 

Jo 


Bft, x) 




dm 


{x,m{f),y) d{m{f) 


m{t)){y)ds, 


c{t) = {fh — m){t)DppHDu{t, ■)){Du — Du){t, ■) 

+m j (^DppH[-, sDiift, •) + (! — s)Du{t, •)) — DppH[-, Du{t, -j))^ {Du — Du){t, ■)ds 

(note that c{t) is a signed measure) and 

^t{x) = j ^MT) + (1 - s)m{T),y) - ^{x, m(T), y)^ d{m{T) - m{T)){y)ds. 
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We apply Lemma [3]5] to get (recalling the notation Xn = 


sup \\{z{t),p{t))\\ 
tG[to,T] 


\\zT\\xr ,+2 + IIPoll(x„+i)' + sup (||6(t)||x„+i + ||c(t)||(x„)') 

te[to,T] 


It remains to estimate the various quantities in the right-hand side. We have 

sup \\b{t)\\xr,+i ^ sup ||^||x„+i + sup ||B||x„+i, 
ts[0,r] te[0,T] te[0,r] 


where, 

sup ||^||x„+i ^ c* sup \\u-u\\x„ ^^Cdl{mo,mo) 
ts[0,T] ts[o,r] 

according to Proposition 13.21 To estimate B and ||.zt||x„+ 2 ) we argue in the same way: 


\\zt\\x„+2+ sup || 5 ||x„+i ^ C'di(mo,mo), 
ts[0,T] 

where we have used as above Proposition 13.21 now combined with assumptions (HFl(n-|-l)) 
and (HGl(n-l-2)) which imply (e.g., for F) that, for any mi,m 2 e 

T^(-, rni,y) - |^(•, m2, y) ) d{mi - m2){y) 

Ijd y5m 5m ) 

6F 6F 

^di(mi,m2) Dy—{-,mi,-)-Dy — {-,m2,-) ^ C'di(mi,m2). 


'5m 


Finally, 


sup ||c(t)|l(x„)' ^ sup sup <^,c(t)>x„,(X„)', 

ie[0.T] ts[0,T] ||5||x„^l 


where, for ||^||x„ ^ 1, 

(^xit))xn,{Xr,y 

(m — m)DppH{■,Du{t, •)) {Du — Du){t, •) 


Ifd 


-I- rh{t) f {Dp H (•, [sDu -f (1 — s)Du]{t, •)) — DpH (•, Du{t, •))) {Du — Du){t, ■)ds 
Jo 

^ C'^ll^llillu - -u||2di(mo,mo) + ll^llcoll'U - "“Hi)- 


W,(W)' 


So again by Proposition 13.21 we get 


sup ||c(t)||(x„)' ^ Cdl{mo,mo). 

te[0,T] 


This completes the proof. 


□ 


3.5 Proof of Theorem 12.81 

Proof of Theorem \2.8\ (existence). We check in a first step that the map U defined by (l3^ is a 
solution of the first order master equation. Let us first assume that mo £ C^(T'^), with mo > 0. 
Let to > 0, {u,m) be the solution of the MFG system ([38]) starting from mo at time to- Then 

U{tQ + h,x,mo) — U{to,x,mo) _ U{to + h,x,mo) — U{to + h,x,m{to + h)) 
h “ h 

17(to -I- h, X, m{to + h)) — U (to, x, mo) 

^ h ■ 
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Let us set = (1 — s)m(to) + sm{to + h). Note that, by the equation satished by m and the 
regularity of U given by Corollary 13.91 


U (to + h, X, m(to + h)) — U (to + h, x, m(to)) 

n 5U 

— (to + h, X, ms,y){m{to + h,y) - m(to, y)) dyds 
■d dm 

SU / \ 

— (to + h,x,ms,y)yAm{t,y) + div[m{t,y)DpH{y, Du{t,y)))j dtdyds 


0 Jt‘^ Jto 

rl r rto + h 

/ / / Ay — {to + h,x,ms,y)m{t,y)dtdyds 

lo JTdJto dm 

-l r rto+h 


fto+n ^jj 

/ Dy—{to + h, X, ms,y) ■ DpH[y, Du{t, y)) m{t, y) dtdyds. 

0 Jt‘^ Jto 

We can then divide by h to obtain, using the continuity of DmU and its smoothness with respect 
to the space variables: 


lim 

h—>0 


[/(to + h,x,m{to + h)) — [/(to + h,x,mo) 
h 


Ijd 


dwy[D,nU]{to,x,mo,y) - DmU{to,x,mo,y) ■ DpH(jj,Du{to,y))'^ mo{y) dy. 


On the other hand, for h > 0, 

U (to + h, X, m{to + h)) — U{to,x, mo) = u(to + h,x) — u{to, x) = hdtu{tQ, x) + o{h), 


since u is smooth, so that 


[/(to +/i, x,m(to + h)) — [/(to,x,mo) 

hm - - - 

h^o+ h 


dtu{to,x). 


Therefore dtU{to, x,mQ) exists and, using the equation satisfied by u, is equal to 


dtU{to,x,mo) = - divy[DmU] {to,x,mo,y)mo{y)dy 

+ / DmU{to,x,mo,y) ■ DpH{x,Dj;U{to,y,mo))mo{y)dy 
Jjd- 

- A^U{to,x,mo) + H{x,D^U{to,x,mo)) - F{x,mo). 


(61) 


This means that U has a continuous time derivative at any point (to,x,mo) where mo e C^(T'^) 
with mo > 0 and satishes (l28l) at such a point. By continuity of the right-hand side of (IGT)) . U 
has a time derivative everywhere and (|28l) holds at any point. □ 


Next we turn to the uniqueness part of the Theorem: 

Proof of Theorem \2.8\ (uniqueness). In order to prove the uniqueness of the solution for the 
master equation, we explicitly show that the solutions of the MFC system (j38h coincide with 
the characteristics of the master equation. Let V be another solution to the master equation. 
The main point is that, by the definition of a solution, D‘^ y^ is bounded, and therefore D^V 
is Lipschitz continuous with respect to the measure variable. 
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Let us fix (to, itiq)- In view of the Lipschitz continuity of DxV, one can easily uniquely solve 
in T],7^(T‘^)) the Fokker-Planck equation: 

[ dtrh — Am — diY[rhDpH{x,DxV{t,x,'m))) = 0 in[to,r]xT‘^ 

I m(to) = mo in 


Then let us set u{t,x) = V{t,x,m{t)). By the regularity properties of V, u is at least of class 
with 


dtu{t,x) = dtV{t,x,m{t)) + (—{t,x,rh{t), ■), dtm{t ))^2 

(5V^ 

= dtV{t,x,rh{t)) + (^—{t,x,rh{t), •), Am + div{mDpH[x,DxV{t,x,m ))')^2 (^c 2 Y 
= dtV{t, X, m{t)) + / divj^ [DmV] {t, x, m{t),y) dm{t){y) 


- / DmVit,x,m{t),y) ■ DpH{y,DxV{t,y,m)) dm{t){y). 


Ijd 


Recalling that V satisfies the master equation, we obtain: 

dtu{t, x) = —AxV{t, X, m{t)) + H (x, DxV{t, x, m(t))) — F(x, m(t)) 

= —Au(t, x) + ff(x, Du{t, x)) — F{x, m{t)) 

with terminal condition u(T,x) = V{T,x,m(T)) = G{x,m(T)). Therefore the pair {u,rh) is 
a solution of the MFG system (1381) . As the solution of this system is unique, we get that 
V{to,x,mo) = U{to,x,mo). □ 


3.6 Lipschitz continuity of with respect to m 

We later need the Lipschitz continuity of the derivative of U with respect to the measure. 

Proposition 3.11. Let us assume that (HFl(n+l)) and (HGl(n+2)) hold for some n ^ 2. 
Then 


sup sup (di(mi,m 2 )) ^ 

ts[0,T] mi^m 2 


5U, 
dm 


5U, 

7—(L-,ni2,-) 
dm 


(n+2+Q:,n+Q:) 


< 00 , 


where C depends on n, F, G, H and T. 

Proof. By continuity of in the measure argument (see Corollaries 13.91 and 13.7|] . we can assume 
without loss of generality that mj and mg are two smooth, positive densities. Let yo ^ C®(T'^). 
We consider (u^,m^) and (u^, mf) the classical solutions to the MFG system (f38|l associated with 
the initial condition (to,mQ) and (to,?^o) respectively and {v^,p}) and the associated 

classical solutions to (l58]l with /i^(to, •) = /^^(^o, •) = To- 

Let us set {z,p) := {v^ — — pP). We first write an equation for {z,p). To avoid too 

heavy notation, we set iL((t, x) = DpH{x, Du^{t,x)), H'{{t,x) = DppH{x, Du^{t,x)), F{{x, p) = 
/t^ etc... Then {z,p) satisfies 


r —dtz — Az + H[Dz = F[{-,p) + b 
< dtp — Ap — diY{pH[) — diY{m}HfDz) — div(c) = 0 
[ z{T) = G^(p(T)) + ZT, m{to) = 0 
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where 


b{t,x) := F{{x,fj.^{t)) - F 2 {x,n‘^{t)) - [{H[ - H 2 )Dv^]{t,x), 
c{t, x) := x){H[ — H 2 ){t, x) + [{rri^H" — m?H 2 )Dv^^ (t, x), 

zt{x) ■.= G'^{^?{T))-G'2{^x\T)). 


We apply Lemma 13.51 with V = H[. Recalling the notation it says that, under 

assumptions (HFl(n+l)) and (HGl(n+2)), 

sup \\z{t,-)\\x „+2 ^ c \\zt\\x„+ 2 + sup (|| 6 (t,-)||x„+i + i|c(t,-)!l(x„)') 

te[to,T] 4s[0,T] 

Let US estimate the various terms in the right-hand side: 

\\zt\\x„+2 ^ “ ^{Or,m‘^{T),y)jn^{T,y)dy 


n+2+a 




— (0, •,m^(r), •) - 

dm dm 


(n+2+Q,n+H-Q;) 


\\y-‘^{T)\\-{n+l+a) 


^ Cdi{ml,ml) ||/iol|-(n+i+a) 

where we have used Proposition I3.61 (iil in the last inequality. Moreover, we have 

where the first term can be estimated as zt- 


The second one is bounded by 

\\{H[ - H'2){t, ■)Dv\t, = \\{DpH{; Du\t, •)) - DpHi;Du\t, ■)))Dv\t, 

^ ||(u^ - •)l|n+2+a!l^^^(L •)lU+2+a 

^Gdi{ml,ml) \\no\\-{n+i+a), 

where the last inequality comes from Proposition 13.21 and Proposition 13.61 thanks to assumptions 
(HFl(n-|-l)) and (HGl(n-|-2)). Finally, by a similar argument, 


— {n+a) 


sup 

'Iln + Q!^;! ' 


f (j){x) iH[ — H' 2 ) + {{m} — rn^)H” + rn^{H'l — H 2 )Dv^) {t,x)dx 
Jjd L J 


^ sup H! 2 ){t,-)\\^^Jy‘^{t,-)\\-(n+a) 

ll^lln + Q'^l 

-t di(m^(t),m^(t)) sup \\(j){H'lDv‘^){t,-)\\ + sup \\(j){H'( - H 2 ){t, 
110111^1 110110^1 

^ C||(u^ - u‘^){t, •)|L+„il^olt-(n+«) 

-t Cdi(m^(t),m2(t))||u^(t, Ob + G\\{u^ - u^){t, Ob 

^ C'di(mbmg)|bo|| — (n+a)- 


This shows that 

sup Ib(t, •)|b+ 2 +a ^ Cdi{ml,ml)\\yo\\_(^n+a)- 

te[to,T] 
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As 


we have proved 


z{to,x) = (^^{to,x,ml,y) - ^{to,x,ml,y)^ My)dy, 


sup (di(mi,m 2 )) 

mi^m2 


-1 


6U , , dU, 

— {tor,mi,-) - j^{to,-,m 2 ,-) 


(n+2+Q:,n+a) 




where the supremum is taken over smooth densities. The map being continuous, we can 
remove the restriction of the measures mi and m 2 by approximation to get the full result. □ 

3.7 Link with the optimal control of Fokker-Planck equation 

We now explain that, when F and G derive from potentials functions F and Q, the space 
derivative D^U is nothing but the derivative with respect to the measure of the solution lA oi a. 
Hamilton-Jacobi equation stated in the space of measures. The fact that the mean field game 
system can be viewed as a necessary condition for an optimal transport of the Kolmogorov 
equation goes back to Lasry and Lions [50]. As explained by Lions |52|, one can also write 
the value function of this optimal control problem, which turns out to be a Hamilton-Jacobi 
equation in the space of measure. The (directional) derivative with respect to the measure of the 
value function is then (at least formally) the solution of the master equation. This is rigorously 
derived, for short horizon and first order (in space and measure) master equation by Gangbo 
and Swiech [28] . We show here that this holds true for the master equation without common 
noise. 

Let us assume that F and G derive from potential maps F : P(T'^) —> M and Q : 'P(T'^) ^ 


N dF, 

F{x,m) = —(x,m), 


G{x,m) = -- — {x,mj. 
dm 


(62) 


Note for later use that the monotonicity of F and G implies the convexity of F and Q. 


Theorem 3.12. Under the assumptions of Theorem \2.8\. let U be the solution to the master 
equation (l37l) and suppose that (l62]l holds. Then the Hamilton-Jacobi-Bellmann equation 

-dtU{t,m)I H{y,DmlA{t,m,y))dm{y)- I div [DmlA] {t,m,y)dm{y) = F{m) 

in [0,r] X P(T'^), 
in P(T'='), 


U{T, m) = Q{m) 
has a unique classieal solution U and 

DmU{t,x,m) = DxU{t, x,m) 


V(t,x,m) e [0,r] X T'^ X V{T'^ 


(64) 


We represent the solution U of (I63p as the value function of an optimal control problem: for 
an initial condition {to,mo) e [0,T] x V{T^), let 


U{to, mo) := inf 


Jto LJt'^ 


H* {x, a{t, x)) m{t, dx) 


dt 


'to 


F {m{t)) dt + Q [m{T)) (65) 


(where H* is the convex conjugate of H with respect to the second variable) under the constraint 
that m 6 C*^([0, T],P(T'^)), a is a bounded and Borel measurable function from [0,T] x into 
and the pair {m, a) satisfies in the sense of distribution: 


dtm — Am — div(am) = 0 in [0, T] x m{to) = mo in T'^. 


( 66 ) 
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Of course, (j66|) is understood as the Fokker-Planck equation describing the flow of measures 
generated on the torus by the SDE 


dZt = —a{t, Zt)dt + dBt, t e [0, T], 


which is is known to be uniquely solvable in the weak sense. Notice that, throughout the 
subsection, we shall use, as in ([65]) . the notation m{t,dx) to denote the integral on the torus 
with respect to the (time-dependent) measure m{t). 

The following characterization of the optimal path for U is due to Lasry and Lions [50]: 

Proposition 3.13. For an initial position [to, mo) e [0, T] x let {u,m) be the solution 

of the MFC system (l38l) . Then {m,a) = {m, DpF[{-, Du{-, ■))) is a minimizer for U{tQ,mQ). 

Proof. For a function m e C*^([0,T],P(T'^)) and a bounded and measurable function a from 
[0, T] X T'^ into M'^, we let 


J(m, a) 

where m solves 



X, a{t, x))dm{t) 



F{fh{t))dt + Q{m{T)) 


dtm — Am — div(dm) = 0 in [0, T] x m(to) = uiq in 

As, for any m' e P(T‘^), a' e 

H*{x, a) = sup (o' • p — F[{x,p)) , 

psR'* 

we have, by convexity of F and Q, 

J{m, a) 

/ / a{t,x) ■ Du{t,x) — F[(x,Du{t,x)) rh{t,dx) dt 

'to -I 

rT 


> 


Jto 

= J{m, a) 

rTr r 


F[m{t)) + F[-,m{t)) [m{t) — m{t)) dt -I- Q[m{T)) + G[-,m{T))[rh{T) — m{T)) 


Ito lJT<t 
f-T 


Du{t, x) ■ [a{t, x)m{t, dx) — a{t, x)m{t, dx)) — H (x, Du{t, x)) {jh — m) (t, dx) 
+ I F(^-,m{t)){m — m){t)dt + G[-,m{T)) (rh{T) — m{T)). 


dt 


'to 

because 

a{t, x) ■ Du{t, x) — H (x, Du{t, x)) = H* (x, a{t, x)). 
Using the equation satisfied by (m, w) and {m, w) we have 
'■Tr f 

I Du{t,x) ■ (^a{t,x)m{t,dx) — a{t,x)m{t,dx)) dt 


to Ut<' 


-iT 


Ijd 


u{t, x)(m — m){t, dx) 


+ / {dtu + Au){t,x)(m — m){t,dx) 

Jo Jo Urd 


dt 


= -G{-,m{T)) (m(r) - m(r)) 


(^F[ (x, Du{t, x)) — F(x, (m — m){t, dx) 


h UTd 

This proves that J{m,a) ^ J{m,a) and shows the optimality of (m,a). 


dt. 

□ 
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Proof of Theorem, \S.l^ First step. Let us first check that U, defined by (f65]l . is with respect 
to m and satisfies 

A7 y c 

- — {t,x,m) = U{t,x,m) — / U{t,y,m)dm{y) V(t, x, m) e [0, T] x T'^ x P(T'^). (67) 

dm Jjd 

Assume for a while that (j67[) holds true. Then, taking the derivative with respect to x on both 
sides shows (IMl) . 

We now prove (f67|) . Let mg, mg be two initial measures, {u,m) and {u,m) be the solutions 
of the MFG system ([38]) with initial conditions (to^w-o) and [to, mo) respectively. Let also (u,/i) 
be the solution of the linearized system (f58]l with initial condition (to,hio — m-o). Let us recall 
that, according to Proposition 13.81 we have 

sup [\\u - u - v\\n+ 2 +a + \\m - m - y\\_(n+i+a)} ^ ?«o) (68) 

te[to,T] 

while Proposition 13.21 and Proposition 13.61 imply that 

sup {l|n - u||n+2+a + ||Ai||-(n+l+a)} ^ Cdi(mo,mo)• 

te[0,T] 


Our aim is to show that 


U{tQ,mQ) -U{tQ,mQ) - / [/(to,a;,m.o)d(m,o - mo)(x) = O(di(mo,mo))- 


(69) 


Indeed, if ([691) holds true, then [/ is a derivative of U and, by convention (fMll . proves ([67)1 . 

Second step. We now turn to the proof of (f69]l . Since {u,m) and {u,rh) are optimal in 
U{tQ,mQ) and U{tQ,mQ) respectively, we have 


U{to,rho) -U{to,mo) 

fT 

H* 


(^x, DpH{x, Du{t,x)))rh{t,dx) — / H*(^x,DpH{x,Du{t,x)))m{t,dx)jdt 
+ [^F{m{t)) - F{m{t))yt + g{m{T)) -g{m{T)). 


'to V-lT'i 

rT 


Note that, by 

rT 

'to 


(^j H* (^x,DpH(^x,Du{t,x))'jm{t,dx) — J H* (^x,DpH(^x,Du{t,x))'jm{t,dx)^ dt 
H* (^x, DpH(^x, Du{t,x))'^ y,{t,dx) 

X, DpH(^x, Du{t,x))'j ■ [DppH(^x,Du{t,x))Dv{t,x)] m{t, dx)^ dt + O(d^(mo, uio)) 
(^Du{t, x) • DpH (x, Du{t, x)) — H (x, Du{t, x))^/i(t, dx) 


Ijd 

rT 


DqH* 


Jto 

+ / Du{t,x) ■\D‘^H{x,Du{t,x))Dv{t,x)^m{t,dx)\dt + 0(d\{mQ,rhQ)), 
JT<t ' 
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where we have used the properties of the Fenchel conjugate in the last equality, while 

l-T r 


[ \T{rh{t)) - T{m{t))]dt + g{m{T)) - g{m{T)) 

Jta L -I 

F(^x, m{t)) fj,{t, dx))dt + / G(x, m(T))/r(T, dx) + O(d^(mo, rh-o)) • 


'to 

Jto \JT<t 

Recalling the equation satisfied by u and /r, we have 
d 


/ u{t,x)^{t,dx) 

JTd- 

/ F[{x,Du{t,x)') — F{x,m{t)) fj,{t,dx) — / Du{t,x) ■ DpH(^x,Du{t,x))fi{t,dx) 

Jjd L J Jjd 


— / Du{t,x) ■ DppF[(^x, Du{t,x))Dv{t,x) m{t,dx). 

J-^d L J 

Putting the last three identities together, we obtain 

l({to,mo) -U{to,mo) 

-LdL u{t, x)^{t, dx)^ dt + L G[x,m{T)) fi{T,dx) + O(d^(mo,mo)) 

= / u{to,x)n{to,dx) + O{dl{mo,mo)) = / U{to,x,mo)d{mo - mo){x) + O{dl{mo,mo)). 
This completes the proof of ([67)1 . 

Third step. Next we show that W is a classical solution to the Hamilton-Jacobi equation (|63p . 
Let us fix {to,mo) e [0,T) x P(T'^), where mo has a smooth, positive density. Let also {u,m) be 
the solution of the MFG system (|38]l with initial condition (to, mo). Proposition 13. 131 states that 
(m, DpH{-, Du{-, •))) is a minimizer for U{to, mo). By standard dynamic programming principle, 
we have therefore, for any he (0,T — to), 


rtQ~\-h 


IT {to, mo) = J J H* (^x,DpF[{x,Du{t,x))^m{t,x)dxdt 


Ijd 

rto-\-h 


rto+n 

/ F{m{t))dt+ L{{to + h,m{to + h)). 
Jto 


(70) 


Now we note that 

U{to + h,mo) - U{to,mo) 


h 


U{to + h, mo) —hl{to + h, m{to + h)) U{to + h, m{to -I- h)) —U{to, mo) 


(71) 


h h 

We can handle the first term in the right-hand side of (|71l) by using the fact that U is with 
respect to m. Letting mg^h ■= (1 ~ s)mo + sm{to + h)), we have: 

U{to + h, m{to + h)) —U{to + h, mo) 

n blA 

— (fo + h,mg^h,y)d{m{to + h) - mo){y)ds 
•d om 

-[LC DmU{to + h,ms,h,y) ■ {pm{t,y) + DpH{y,Du{t,y))m{t,y)'^ dtdyds. 
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Dividing by h, letting /i —> 0'^ and rearranging gives 
lim ^—^{to + h, mo) 

/i^0+ /i 


Td 


div [DmU] (to, rno,y)dmo{y) 


Dmld{to,mo,y) ■ DpH[y, Du{to,y)) dmo{y). 


To handle the second term in the right-hand side of (I7T]1 . we use (f7n|) and get 


lim 

h^0+ 


U (to + h, m(to + h)) — (/{{to, mo) 
h 


/ H* (^x, DpH{x, Du{to,x)))dmo{x) — J^{mo). 


As Du{to,x) = DxU{to,x,mo) = DmU{to,mo,x), we have 


- H''{x,DpH[x,Du{t,x))) - Dml{{to,mo,x) ■ DpH[y, Du{to,y)) 

= -H*[x,DpH[x,DmU{to,mo,x))) + DynU{tQ,mQ,x) • DpH[x, Dml{{to,mo, x)) 
= H [x, DmU (to, mo, x)). 


Collecting the above equalities, we obtain therefore 


U{to + h,mo) - U{to,mo) 
h^o+ h 

= -/ div [DmU] {to, mo, y)dmo{y) + H{x, DmU{to,mo,x))dmo{x) - D{mo). 

JTd, Jjd 

As the right-hand side of the above equality is continuous in all variables, this shows that U is 
continuously derivable with respect to t and satisfies ([63]). 

Last step. We finally check that U is the unique classical solution to (1631) . For this we use 
the standard comparison argument. Let V be another classical solution and assume that V ¥^U. 
To fix the ideas, let us suppose that sup(V — U) is positive. Then, for any e > 0 small enough, 

sup V(f, m) — Zi(f, m)-I-elog(;^) 

(i,a;)G(0,T]x7’(T<*) ^ 

is positive. Let (f, m) be a maximum point. Note that t < T because V(T, •) = U{T,-). By 
optimality of {t, rh) and regularity of V and U, we have: 

dtV{i,m) — dtU{i,rh) + % = d and -^(f, m, •) =-^(f, m, •), 

t dm dm 

so that 


DmV{i,m,-) = DmU{i,m,-) and div [D^V] (t, m, •) = div [DmU] {i,m,-). 
Using the equation satisfied by U and V yields to | = 0, a contradiction. 


□ 
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4 MFG system with a common noise 


The main purpose of the two next sections is to show that the same approach as the one developed 
in the previous section may be implemented in the case when the whole system is forced by a 
so-called ‘common noise’. Such a common noise is sometimes referred to as a ’systemic noise’, 
see for instance Lions’ lectures at the College de France. 

Thinking of a game with a finite number of players, the common noise describes some noise 
that affects all the players in the same way, so that the dynamics of one given particle read^ 

dXt = -DpHiXt, Dut{Xt))dt + V2dBt + t e [0, T], (72) 

where /3 is a nonnegative parameter, B and W are two independent d-dimensional Wiener pro¬ 
cesses, B standing for the same idiosyncratic noise as in the previous section and W now standing 
for the so-called common noise. Throughout the section, we use the standard convention from 
the theory of stochastic processes that consists in indicating the time parameter as an index in 
random functions. 

As we shall see next, the effect of the common noise is to randomize the MFG equilibria so 
that, with the same notations as above, becomes a random flow of measures. Precisely, 

it reads as the flow of conditional marginal measures of (Ait)jg[o,r] given the realization of W. In 
order to distinguish things properly, we shall refer the situation discussed in the previous section 
to as the ‘deterministic’ or ‘first-order’ case. In this way, we point out that, without common 
noise, equilibria are completely deterministic. Compared to the notation of the introduction or 
of section [21 we let the level of common noise j3 be equal to 1 throughout the section: this is 
without loss of generality and simplifies (a little) the notation. 

This section is specifically devoted to the analysis of the MFG system in the presence of 
the common noise (see dSl)). Using a continuation like argument (instead of the classical strat¬ 
egy based on the Schauder fixed point theorem), we investigate existence and uniqueness of a 
solution. On the model of the first-order case, we also investigate the linearized system. The 
derivation of the master equation is deferred to the next section. The use of the continuation 
method in the analysis of MFG systems is a new point, which is directly inspired from the 
analysis of finite dimensional forward-backward systems: Its application is here made possible 
thanks to the monotonicity assumption required on F and G. 

As already mentioned, we assume without loss of generality that (3 = 1 throughout this 
section. 

4.1 Stochastic Fokker-Planck/Hamilton-Jacobi-Bellman System 

The major difficulty for handling MFG with a common noise is that the system made of the 
Fokker-Planck and Hamilton-Jacobi-Bellman equations in (j.SSp becomes stochastic. Its general 
form has been already discussed in m- Both the forward and the backward equations become 
stochastic as both the equilibrium (mt)o<ji<jr and the value function depend upon the 

realization of the common noise W. Unfortunately, the stochastic system does not consist of a 
simple randomization of the coefficients: In order to ensure that the value function ut at time t 
depends upon the past before t in the realization of (HA)oscs!£T) the backward equation incor¬ 
porates an additional correction term which is reminiscent of the theory of finite-dimensional 
backward stochastic differential equations. 

^Equation (ESI) is set on R'* but the solution may be canonically mapped onto T"* since the coefficients are 
periodic: When the process {Xt)te[o,T] is initialized with a probability measure on T"*, the dynamics on the torns 
are independent of the representative in R‘* of the initial condition. 
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The Fokker-Planck equation satisfied by {rnt)te[o^T] reads 

dtmt = [2Ami + d\Y{mtDpH{mt,Dut))^dt — ^/2d\\{mtdWt) ■, t e [0,r]. (73) 

The value function u is sought as the solution of the stochastic HJB equation: 

dtut = {-2Aut + H{x,Dut) - F{x,mt) - V2div{vt)}dt + vt ■ dWt, (74) 

where, at any time t, vt is a random function of x with values in Once again, we emphasize 
that the term vt ■ dWt = Xj^=i permits to guarantee that (ui)os;ts;T is adapted with 

respect to the filtration generated by the common noise. The extra term may be 

explained by the so-called Ito-Wentzell formula, which is the chain rule for random fields applied 
to random processes, see for instance [11 . It permits to cancel out the bracket that arises in the 
application of the Ito-Wentzell formulro to {ut{Xt))te[Q^T]: with {Xt)o^t^T as in ([72]). Indeed, 
when expanding the infinitesimal variation of (ut(Xt))jg[o,r]) the martingale term contained 
in ut conspires with the martingale term contained in X and generates an additional bracket 
term. This additional bracket term is precisely \f2diY{vt){Xt)] it thus cancels out with the term 
—^/2div{vt){Xt) that appears in the dynamics of Ut- For the sake of completeness, we provide 
a rough version of the computations that enter the definition of this additional bracket. When 
expanding the difference Ut+dt{Xt+dt) — ut{Xt), for t e [0,T] and an infinitesimal variation dt, 
the martingale structure in (ITU) induces a term of the form vt{Xt+dt)iW't+dt — Wt). By standard 
Ito’s formula, it looks like 


vt{Xt+dt){Wt+dt-Wt) 

d 


= Y,vliX,+dt){W}^dt- 


Wl) = 2 vl[Xt)dWl + V2 2 j^{Xt)dt, 


1=1 


2=1 


dxi 


(75) 


the last term matching precisely the divergence term (up to the sign) that appears in (1741) . 

As in the deterministic case, our aim is to define U by means of the same formula as in 
(|39l) . that is C/(0,x,mo) is the value at point x of the value function taken at time 0 when the 
population is initialized with the distribution tuq. 

In order to proceed, the idea is to reduce the equations by taking advantage of the additive 
structure of the common noise. The point is to make the (formal) change of variable 

ut{x) = ut{x + ^/2Wt)mt{x) = mt{x + ^/2Wt), x e te[0,r]. 


The second definition makes sense when mt is a density, which is the case in the analysis 
because of the smoothing effect of the noise. A more rigorous way to define mt is to let it be 
the push-forward of mt by the shift 9 x i—> x — ^/2Wt £ T^. Pay attention that such a 
definition is completely licit as mt reads as a conditional measure given the common noise. As 
the conditioning consists in freezing the common noise, the shift x x — \f2Wt may be seen as 
a ‘deterministic’ mapping. 

The main feature is that rht is the conditional law of the process {Xt — V2Wt)te[o,T] given 
the common noise. Since 


d{Xt-V2Wt) = -DpH{Xt-V2Wt + V2Wt,DutiXt-V2Wt + V2Wt))dt + V2dBt, te[0,T]. 

®In the application of Ito-Wentzell formula, ut is seen as a (random) periodic function from to R. 
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we get that {mt)te[o,T] should satisfy 

dtifit = {Arht + div(^rhtDpH{- + V2Wt, Dut))] dt 
= {Amt + div(jhtDpHt{-,Dut))}dt, 


(76) 


where we have denoted Ht{x,p) = H{x + ^/2Wt,p). This reads as the standard Fokker-Planck 
equation but in a random medium. Such a computation may be recovered by applying the Ito- 
Wentzell formula to {mt{x + ■\/^Wt))te[o^T]^ provided that each mt be smooth enough in space. 
Quite remarkably, (hT.)jg[o,T] is of absolutely continuous variation in time, which has a clear 
meaning when {'nit)te[o,T] is seen as a process with values in a set of smooth functions; when 
{rht)te[o,T] is seen as a process with values in V{T'^), the process ((y?,mi))te[o,T] ((t) standing 
for the duality bracket) is indeed of absolutely continuous variation. 

Similarly, we can apply (at least formally) Ito-Wentzell formula to {ut{x + V^Wt))te[o,T] iir 
order to express the dynamics of {ut)te[o,T]- 

dtut = {-Aut + H{-+ V2Wt,Dut) - F{-+ V2Wt,mt)}dt + vtdWt, 

~ . ( 77 ) 

= {-Aut + Ht{-,Dut) - Ft{-,mt)}dt + vtdWt, t 6 [0,r], 


where Ft{x,m) = F{x + for a new representation term Vt{x) = Vt{x + ^/2Wt), the 

boundary condition writing ut{') = G{-,mT) with G{x,m) = G{x + \^WT,m). In such a way, 
we completely avoid any discussion about the smoothness of v. Pay attention that there is no 
way to get rid of the stochastic integral as it permits to ensure that ut remains adapted with 
respect to the observation up until time t. 

Below, we shall investigate the system ([76]) - (l77t) directly. It is only in the next section, see 
Subsection 15.51 that we make the connection with the original formulation (j73h - (j74p and then 
complete the proof of Corollary 12.121 The reason is that it suffices to define the solution of 
the master equation by letting 17(0, x, mo) be the value of uq{x) with mo as initial distribution. 
Notice indeed that 'Uo(x) is expected to match uq{x) = uo{x — v^kPo) = uo{x). Of course, the 
same strategy may be applied at any time t e [0, T] by investigating {us{x + \/2{Ws — Wt)))se[t,T]- 

With these notations, the monotonicity assumption takes the form: 

Lemma 4.1. Let m and m' he two elements o/P(T'^). For some t e [0,T] and for some 
realization of the noise, denote by m and m' the push-forwards of m and rrf by the mapping 
T'^ 3 X I—>■ x — V^Wt 6 Then, for the given realization of (II4)se[o,t]; 

/ {Ft{x,m) — Ft{x,m'))d{rh — m) ^ 0, / {G{x,m) — G{x,m'){d{m — m) ^ 0. 

Proof. The proof consists of a straightforward change of variable. □ 


Remark 4.2. Below, we shall use quite systematically, without recalling it, the notation tilde ~ 
in order to denote the new coefficients and the new solutions after the random change of variable 
X i-> X + s/^Wt ■ 


4.2 Probabilistic Set-Up 

Throughout the section, we shall use the probabilistic space (P, A, P) equipped with two inde¬ 
pendent d-dimensional Brownian motions and The probability space is assumed 

to be complete. We then denote by the completion of the filtration generated by 

When needed, we shall also use the filtration generated by {Bt)t^o- 
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( 78 ) 


Given an initial distribution mo e we consider the system 

dtrht = {Arht + div{rhtDpHti, Dut))}dt, 
dm = {-Aut + Ht{-, Dut) - Ft{-,mt)]dt + dMt, 

with the initial condition mg = mg and the terminal boundary condition ut = G{-,mT), with 
G{x, mT) = G{x + s/^Wt, rriT)- 

The solution (rit)fg[g^'r] is seen as an (J^t)ig[g ^^j-adapted process with paths in the space 
C'^([0, T], C”(T‘^)), where n is a large enough integer (see the precise statements below). The pro¬ 
cess {'mt)te[o,T] reads as an (J))jg|-g yj-adapted process with paths in the space C*’([0, T], 7’(T'^)), 
7^(T'^) being equipped with the 1-Wasserstein metric di. We shall look for solutions satisfying 

sup (||{it||„+«) e L“(n,^,P), (79) 

tG[o,r] 


for some a e (0,1). 

The process (Mt)^i^^o,T] is seen as an (J))^g[g 2 ’]-adapted process with paths in the space 
C'^([0,T],C’^“^(T‘^)), such that, for any x e T'^, (Mt(x))tg[o,T] is an (Ft)te[o,T] martingale. It is 
required to satisfy 

sup (||Mt|tn_ 2 +a) e L®(n,^,P). (80) 

ts[0,T] 

Notice that, for our purpose, there is no need to discuss of the representation of the martingale 
as a stochastic integral. 

4.3 Solvability of the Stochastic FP/HJB System 

The objective is to discuss the existence and uniqueness of a classical solution to such the 
system (j78|] under the same assumptions as in the deterministic case. Theorem 14.31 below covers 
Theorem 12.91 in Section [2) 

Theorem 4.3. Assume that F, G and H satiny (I26p and m in Subsection \2.3[ Assume 
moreover that, for some integer n ^ 2 and somQ a e [0,1), (HFl(n-l)) and (HGl(n)) hold 
true. 

Then, there exists a unique solution {rht,ut, Mt)te[o,T] ([78]) . with the prescribed initial 
condition niQ = mo, satisfying ([ 7 ^ and (IHOl) . It satisfies sup^gj-g 7i](|['Ui||„+a -I- \\Mt\\n+a-2) £ 
L^{Q,A,F). 

Moreover, we can find a constant C such that, for any two initial conditions mo and mg in 
7’(T'^), we have 

sup [d\{mt,m'f)+ \\ut-u[fn+o) P-a.e., 

tG[o,r] 

where {m,u,M) and {m',u',M') denote the solutions to ([TH]) with mo and mg as initial condi¬ 
tions. 

Theorem 14.31 is the analogue of Propositions [3T] and [3.21 in the deterministic setting, except 
that we do not discuss the time regularity of the solutions (which, as well known in the theory 
of finite dimensional BSDEs, may be a rather difficult question). 

^In most of the analysis, a is assumed to be (strictly) positive, except in this statement where it may be zero. 
Including the case a = 0 allows for a larger range of application of the uniqueness property. 
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The strategy of proof relies on the so-called continuation method. We emphasize that, 
differently from the standard argument that is used in the deterministic case, we will not make 
use of Schauder’s theorem to establish the existence of a solution. The reason is that, in order 
to apply Schauder’s theorem, we would need a compactness criterion on the space on which the 
equilibrium is defined, namely L°°(n,.T,P;C°([0,T],7^(T‘^))). As already noticed in the earlier 
paper [2T], this would ask for a careful (and certainly complicated) discussion on the choice of 
Q and then on the behavior of the solution to ([78]) with respect to the topology put on Q. 

Here the idea is as follows. Given two parameters e [0,1]^, we shall first have a look 

at the parameterized system: 

dtmt = {Arht + div[mt('i9DpHt(-, Dut) -t bt)]}dt, 

f ~ ~ V - ( 81 ) 

dtut = {-Aut + Dut) - wFt{-, mt) + ft]dt + dMt, 

with the initial condition rfiQ = tuq and the terminal boundary condition ut = -I- qt, 

where dr) is some input. 

In the above equation, there are two extreme regimes: when -d = ro = 0 and the input is 
arbitrary, the equation is known to be explicitly solvable; when 'd = tjj = 1 and the input is set 
equal to 0, the above equation fits the original one. This is our precise purpose to prove first, by 
a standard contraction argument, that the equation is solvable when d = 1 and zu = 0 and then 
to propagate existence and uniqueness from the case {d,w) = (1,0) to the case {d^w) = (1,1) 
by means of a continuation argument. 

Throughout the analysis, the assumption of Theorem 14.31 is in force. Generally speaking, 
the inputs {bt)te[o,T] s-iid {ft)tE[o,T] (•Ti)tg[o^r] adapted processes with paths in the space 
C°([0,T], [C^(T'’*)]'^) and C°([0,T],C”'“^(T'’*)) respectively. Similarly, qt is an Jr-measurable 
random variable with realizations in We shall require that 

sup ||6f||i, sup ||/tj| n—l+a? II^^TlIn+o: 

tG[0,T] iG[0,T] 

are bounded (in L®(H,^,P)). 

It is worth mentioning that, whenever : [0, T] x ^ M is a continuous mapping such 
that <^(t, •) 6 for any t 6 [0, T], the mapping [0,T] a t ||y?(t, •)||a is lower semi- 

continuous and, thus, the mapping [0,T] a t sup^gp^q ||y?(t, •)||a is continuous. In partic¬ 
ular, whenever {ft)te[o,T] is a process with paths in C‘^([0, T],C^(T'’^)), for some k ^ 0, the 
quantity supig[o,T] ll/t||fc+a is a random variable, equal to supig[o,T]nQ \\ft\\k+a, and the process 
(sup^gj-Q^q ll/s||A:+a)iE[o,T] has continuous paths. As a byproduct, 

eSSUp^gj^ sup \\ft\\k+a = sup eSSUp^gf^||/t||fc+a- 
te[0,T] te[0,T] 


4.3.1 Case d = zu = 0 

We start with the following simple lemma: 

Lemma 4.4. Assume that d = zu = 0. Then, with the same type of inputs as above, iH) has 
a unique solution Mt)tE[o,T]> with the prescribed initial condition. It satisfies (17^ and 

dHO]). Moreover, there exists a constant C, only depending on n and T, such that 

essup^g!^ sup ||ui||„+Q, ^ C'(essup^gQ||fl'T||n+a + essup^gQ sup \\ft\\n-l+a), (82) 

tG[0,r] ie[0,T] 
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Proof of Lemma \4-4[ When = ro = 0, the forward equation simply reads 

dtifit = {Amt + div[mt6t] }(it, t e [0,T] 

with initial condition mo- This is a standard Kolmogorov equation (with random coefficient) 
which is pathwise solvable. By standard estimates, we have 

di{fhs-,rhs) 

essup^gj^ sup —-^ ^ essup^gnl|o|lc»- 

As d = w = 0, the backward equation in (1811) has the form: 

dtut = {-Aut + ft}dt + dMt, t6[0,r], 

with the terminal boundary condition ut = qt- Although the equation is infinite-dimensional, it 
may be solved in a quite straightforward way. Taking the conditional expectation given s e [0, T] 
in the above equation, we indeed get that any solution should satisfy (provided we can exchange 
differentiation and conditional expectation): 

dtE[ut\Ts] = {-AE[Mt|J( 5 ] +^[ft\J^s]}dt, te [s,r], 

which suggests to let 

rT 

tts(x) = E[u5(x)|J;], Us{x) = Pt-s9t{x) - j Pt-sftix)dt, s 6 [0, T], x 6 (83) 

where P denotes the heat semigroup (but associated with the Laplace operator A instead of 
(1/2)A). For any s e [0, T] and x e T'^, the conditional expectation is uniquely defined up to a 
negligible event under P. We claim that, for any s e [0, T], we can find a version of the conditional 
expectation in such a way that the process [0, T] 3 s (T"^ 3 x Us{x)) reads as a progressively- 
measurable random variable with values in C^([0,T],C^(T'^)). By the representation formula 
(1831) . we indeed have that, P almost surely, u is jointly continuous in time and space. Making 
use of Lemma [4.61 below, we deduce that the realizations of [0, T] 3 s (T'^ 3 x i--> ns(x)) 
belong to C°([0,T],C°(T'^)), the mapping [0,T] x Ll 3 {s,uj) (T'^ 3 x (ms(w))(x)) being 
measurable with respect to the progressive u-field 

^ = {Ae }3{[O,T])0A: Vt e [0,T], A n ([0,t] x LI) 6 B([0,t]) ® J)}. (84) 

By the maximum principle, we can find a constant C, depending on T and d only, such that 

essup^gQ sup llusllo ^ essup^gQ sup ||ms||o ^ C(essup^gQ|| 5 rT||o + essup^gs^ sup \\fs\\o)- 
sg[ 0,T] sG[0,r] O^ssrr 

More generally, taking the representation formula ()83l) at two different x,x' e and then 
making the difference, we get 

essup^gj^ sup llfislla ^ ^(eSSUp^gQllc/TlU PeSSUp^gQ sup \\fs\\a)- 
ss[o,r] s£[o,t] 

We now proceed with the derivatives of higher order. Generally speaking, there are two ways to 
differentiate the representation formula (I83|) . The first one is to say that, for any k e {1,..., n — 
1 }, 

dIus{x)=Pt-s{D'^9t){x)-P t-s{D^Jt){x)dt, (s,x)e [0,r] xT^ (85) 
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which may be established by a standard induction argument. The second way is to make use of 
the regularization property of the heat kernel in order to go one step further, namely, for any 
A: e {1,... ,n}. 


= Pt-s 
= Pt-s 


{D’^gr) (x) - DPt.s {x)dt, 

(Z)V)(x)- r ^ DPt{D^,-^ft+s){x)dt, 
Jo 


(s, x) 6 [0, r] X T'^, 


( 86 ) 


where DPt-s stands for the derivative of the heat semigroup. Equation (1861) is easily derived 
from (|85p . It permits to handle the fact that / is (n — l)-times differentiable only. 

Recalling that \DPt(p\ ^ for any bounded Borel function </? : ^ M and for 

some c ^ 1 independent of ip and of t 6 [0,T], we deduce that, for any k e {!,...,re}, the 
mapping [0,T] x T'^ 9 (s,x) i—> D^Us{x) is continuous. Moreover, we can find a constant C such 
that, for any s e [0, T], 


eSSUp^jgQ II II k+a 




essup^gQ II || k+a 



1 

y/t — S 


ft\\k+a-ldt. 


(87) 


In particular, invoking once again Lemma 14.61 below, we can find a version of the condi¬ 
tional expectation in the representation formula Usix) = E['Us(x)|such that u has paths 
in C°([0,T],C"'(T'^)). For any k e {!,...,re}, D^u is progressively-measurable and, for all 
(s,x) e [0, T] X T'^, it holds that D^Us{x) = E[D^res(x)| 

Using (f87)) . we have, for any k e {1,..., re}. 


essup^gj^ sup II re^ 
sg[0,T] 


k+a ^ C(essup^gj^|| 5 T||fc+a Tessup^gj^ sup ||/s||fc+«-i). 

sg[0,T] 


Now that u has been constructed, it remains to reconstruct the martingale part (Mt)o<ct^T 
in the backward equation of the system (18111 (with iJ = zu = 0 therein). Since u has trajectories 
in C°([0,T],C"+"(T‘^)), re ^ 2, we can let: 

Aus{x)ds — f fs{x)ds, te[0, T], x e T'^. 

Jo 


Mt{x) = ut{x) - uo{x) 


It is then clear that M has trajectories in C^([0,T],C” "^(T“)) and that 


essup^gf; sup Mt „ , _ < 00. 


te[0,T] 


Iln+Q—2 


It thus remains to prove that, for each x e T'^, the process (Mt(x))o<;t<;T is a martingale (starting 
from 0). Clearly, it has continuous and (J^ijosgtigT-adapted paths. Moreover, 


fT fT 

Mt{x) — Mt{x) = gT{x) — ut{x) + / Aus{x)ds — / fs{x)ds, t6[0,T], x 6 T^. 

Jt Jt 


Now, recalling the relationship E[Ares(x)|Jre] = E[Ares(x)|J^], we get 


E[ / Aus{x)ds\Pt] = ^[ [ Aus{x)ds\Pt]- 

Jt Jt 
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Taking the conditional expectation given J'i, we deduce that 


E[Mt{x) - Mt{x)\Tt] =E 


grix) - ut{x) - / fs{x)ds + / Aus{x)ds\Tt 

Jt Jt 


= 0 , 


the second equality following from (l83|) . This shows that Mt{x) = so that the 

process (Mi(x))o^t^T is a martingale, as required. □ 

Remark 4.5. Notice that, alternatively to (1821) . we also have, by Doob’s inequality, 

E[ sup \\ut\\l+o] ^ C^[\\9T\\l+a + sup \\ft\\l+^_i]. (88) 

ts[0,r] te[0,T] 


Lemma 4.6. Consider a random field lA : [0, T] x —> M, with continuous paths (in the variable 

{t,x) e [0,T] X T^), such that 

essup^g^IlZYllo < 00. 

Then, we can find a version of the random field [0,T] x 9 {t,x) >—> E\U{t,x)\Ft\ such that 
[0, T]Bt^ (T'^ 9 X t—> K[l{{t,x)\iFt]) is a progressively-measurable random variable with values 
in C°([0,T],C°(T'^)), the progressive a-field SA being defined in (IHll) . 

More generally, if, for some k ^ 1, the paths of U are k-times differentiable in the space 
variable, the derivatives up to the order k having jointly continuous (in (t, x)) paths and satisfying 


essup^gQ sup \\U{t,-)\\k < OB, 
te[o,T] 

then we can find a version of the random field [0,T] x a (t,x) E\lA{t,x)\Ft\ that is 
proqressively-measurable and that has paths inC^(\0,TfiC^(T'^)), the derivative of orderi writing 
[0,r] xT<^ 3 {t,x) ^E[DiU(t,x)\Ft]. 

Proof. First step. We first prove the first part of the statement (existence of a progressively- 
measurable version with continuous paths). Existence of a differentiable version will be handled 
next. A key fact in the proof is that, the filtration {Ft)te[o,T] being generated by iWt)te[o,T]i any 
martingale with respect to {Ft)te[o,T] admits a continuous version. 

Throughout the proof, we denote by w the (pathwise) modulus of continuity of lA on the 
compact set [0,T] x namely: 


w{S) = sup sup \lA{s,x)—lA{t,y)\, 6 > 0. 

x^ysT^^ilx—yl^S s,tG[0,T]:|t— 

Since essup^gf^||Zi||o < oo, we have, for any <5 > 0, 


essup^^Qw{6) < 00 . 


By Doob’s inequality, we have that, for any integer p ^ 1, 


Ve > 0, 


sup E 

se[0,T] 


w{^]\Fs 




w\ - 
P 


the right-hand side converging to 0 as p tends to oo, thanks to Lebesgue’s dominated convergence 
theorem. Therefore, by a standard application of Borel-Cantelli Lemma, we can find an increas¬ 
ing sequence of integers {ap)p^i such that the sequence (sup^gp E[t(;(l/ap)|J^s])p 5 ,i converges 
to 0 with probability 1. 
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We now come back to the original problem. For any {t,x) e [0,r] x we let 


V{t,x) =E[U{t,x)\Tt]. 


The difficulty comes from the fact that each V(f, x) is uniquely defined up to a negligible set. 
The objective is thus to choose each of these negligible sets in a relevant way. 

Denoting by T a dense countable subset of [0, T] and by T a dense countable subset of 
we can find a negligible event N A such that, outside N, the process [0, T] 3 s E[U{t, x)\Fs\ 
has a continuous version for any t e T and x e X. Modifying the set N if necessary, we have, 
outside N, for any integer p ^ 1, any t,t' and x,x' e X, with \t — t'\ + |x — x'\ ^ l/up, 

sup |E[W(t,x)|J-'s] — x^)|T's]| ^ sup E[t(;(—)l-^s]) 

ss[0,T] se[0,T] 

the right-hand side converging to 0 as p tends to oo. Therefore, by a uniform continuity extension 
argument, it is thus possible to extend continuously, outside N, the mapping T x X 3 {t,x) >—>■ 
([0,T] 9 s 1 -^ E\U{t^x)\Xs\) e C'^([0,T],M) to the entire [0,T] x T'^. For any (f,x) e [0,T] x T'^, 
the value of the extension is a version of the conditional expectation E[W(t, x)|J-'s]. Outside N, 
the slice (s,x) > E[f//(s, xjlJxj] is obviously continuous. Moreover, it satisfies, for all p ^ 1, 

VxjX^eT*^, |x — x^l ^ ^ sup |E[ZY(s, x)IT's] — E[W(s,x')IT's]! ^ sup E[t(;(—)|Ts], 

se[0,T] «s[o,r] 


which says that, for each realization outside N, the functions (T^ 3 x > E[ZY(s, x)|T's])sg[o,T] 
are equicontinuous. Together with the continuity in s, we deduce that, outside N, the function 
[0,r] 3 s 1 -^ (T'^ 3 X ^ E[W(s,xjjT's]) e is continuous. On N, we can arbitrarily let 

"F = 0, which is licit since N has zero probability. Progressive-measurability is then easily checked 
(the fact that V is arbitrarily defined on N does not matter since the filtration is complete). 

Second step. We now handle the second part of the statement (existence of a version). 
By a straightforward induction argument, it suffices to treat the case k = 1. By the first step, 
we already know that the random field [0,T] x 3 (f,x) E[Dxu{t,x)\Ft\ has a continuous 
version. In particular, for any unit vector e e it makes sense to consider the mapping 


X M* 3 {x,h) ^ ^(E[U{t,x + he)\Xt] -E[U{t,x)\Xt]^ -E[(DM{t,x),e}\Xt]. 


Notice that we can find an event of probability 1, on which 
1 


^(E[U{t,x + he)\Xt] -E[U{t,x)\Xt]) - E[<D^Z^(f,x),e>|T't] 

J (^DxU{t,x + Xhe) — DxU{t,x),e^dX\Xt 


(89) 


E 


[DAJ{t,x + Xhe),e')\Ft 


-E 


[DJA{t,x),e')\Tt 


dX 


where we used the fact the mapping [0,r] x T'^ 3 (t, x) E{Dxu{t,x)\Ft\ has continuous paths 
in order to guarantee the integrability of the integrand in the third line. By continuity of the 
paths again, the right hand side tends to 0 with h (uniformly in t and x). □ 

Instead of (|82p . we will sometimes make use of the following: 
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Lemma 4.7. We can find a constant C such that, whenever'd = w = 0, any solution to (I81jl 
satisfies: 

rT 


Mk e 




essup^gQ II 1^5 II /c+Q 


■\Js — t 


ds 


i: 


^ c{ essup^g^llffTlIfc+a + / essup^g^ll/sjlfc+a-ids • 


Proof. Assume that we have a solution to (j81|] . Then, making use of (I87|] in the proof of Lemma 
14.41 we have that, for all A: e {1,..., n} and all s e [0, T], 


essup^gQllusIlfc+Q, ^ Ci eSSUp^gf2||5'T||fc+a T 


r 


eSSUp^gQ II fr II k+a—1 

'Jr - s 


dr . 


(90) 


Dividing by -Js — t for a given t e [0,T], integrating from t to T and modifying the value of C 
if necessary, we deduce that 

rT 


6SSUp^gj-2 II II k-\-a 


\/s — t 


I 

^ C'^essup^gs^||5rr||fc+« + / I 


essup^jgQ ll/rllfc+o—1 
'Js — ty/r — s 


dr 


= C 


I 


eSSUp^gf^llfl'TlIfc+a T / SSSUp^gQ II/j-j|fc_|_Q,_i 


/ 


1 


t 'Js - t'Jr - s 


ds ]dr 


the last line following from Fubini’s theorem. The result easily follows. 
Following (I88ji , we shall use the following variant of Lemma 14.71 


□ 


Lemma 4.8. For p e {1,2}, we can find a constant C such that, whenever id = zu = 0, any 
solution to (ISTI) satisfies, for all t e [0,T].- 


VA: e { 1 ,..., n}, E 


i 


T It,-; IIP 


II k+a 


dsjP't 


^ C 7 E 


I 


5r||^+„+/ ||/r||^+„_icAr| J) 


'Js — t 

Proof. The proof goes along the same lines as that of Lemma 14.71 We start with the following 
variant of (l87j) . that holds, for any s e [0, T], 

rT IpiP 


ll«.||fc+a ^ <^®^^II5 t||^+„ + 

Therefore, for any 0 ^ t ^ s ^ T, we get 


r 


Vr - s 


(91) 


E[II«.|ILJ-^*] ^ ||5T|lLa + 


/ 


T IIIIP 


'■llfc+a-1 


■Jr - s 


dr Ft 


Dividing by Js — t and integrating in s, we get 


E 

Therefore 


/ 


T lU", IIP 


I fc + CK 


Js — t 


ds \Ft 


^ CE 


dTfk+a + I II/> 


E 


I 


T 11^ IIP 


^ II k-\-a 


\/s — t 

which completes the proof. 


ds \Ft 


^ CE 


9 t\\ 1 +^ + J^ \\fr\\l+a.-idr\J^t 


ds ) dr\Ft 


□ 
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4.3.2 A priori estimates 


In the previous paragraph, we handled the case H = w = 0. In order to handle the more general 
case when (??, ro) e [ 0 , 1 ]^, we shall use the following a priori regularity estimate; 

Lemma 4.9. Let ( 6 °) 4 g[o^r] {ft)te[o,T] be (.?^t)te[o,r] adapted processes with paths in the space 
C°([ 0 ,T],C^(T'^,M'^)) andC°([ 0 , r],C"'“^(T‘^)) and gr be an Tt- measurable random variable with 
values in such that 

essup^gj^ sup || 6 ?||i, essup^gj^ sup \\ff\\n+a^i, essup^gs^|| 5 r?,||„+« ^ C, 

f£[0,T] te[0,T] 

for some constant C ^ 0. Then, for any k e {0, ... ,n}, we can find two constants and A^, 
depending upon C, such that, denoting by B the cylinder: 

B:= [weC\[0,T],C^{T'^)):\/ke{0,...,n}, Vte [0,r], ||t(;tj|fc+„ ^ exp(Afc(r - t)) }, 


it holds that, for any integer N ^ 1, any family of adapted processes {nd,vd)i=i,...,N with paths 
in C'^([0, T], P(T'^)) X B, any families {a^)i=i,...,N £ [0,1]'^ and {b^)i=i,...,N £ [0,1]'^ with a^ + 
■ ■ ■ + a^ ^2 and b^ + ■ ■ ■ + b^ ^ 2, and any input {ft)te[o,T] <^i^d gx of the form 


N 


N 


ft = Y,b^Ht{;DiT^)-UFt{-M)\+fl gx = ^ m^) + 5 ^, 


2=1 


2=1 


any solution {rh,u) to (| 8 T]i for some 'd,w e [0,1] has paths in C*’([ 0 , T],'P(T‘^)) x B, that is 

essup^gf^l|uf||fc+« ^ AA:exp(AA:(T -t)), te [0,r]. 

Proof. Consider the source term in the backward equation in (j81h : 


N 

:= mi; Dut) - vaFti; mt) + ^ b'^ti; Dul) - FFti; mj)] + /°. 

2=1 


Then, for any fc e {1,..., n}, we can find a constant Ck and a continuous non-decreasing function 
‘hfc, independent of (m*, id), i = 1,..., N, and of (m, u) (but depending on the inputs ibt)te[o,T]j 
ift)te[o,T] and gx), such that 


I II fc+Q;—1 ^ Cjf 


1 -h 4>fc( l|utl|fc+a-i + max \\ul\\k+a-i) + \\ut\\k+a + max \\ul\\k+Q 


(92) 


When k = 1, the above bound holds true with dJi = 0: It then follows from (HFl(n-l)) and 
from the fact that H (or equivalently Ht) is globally Lipschitz in {x,p) (uniformly in t if dealing 
with Ht instead of iL). When fee {2,...,n}, it follows from the standard Faa di Bruno formula 
for the higher-order derivatives of the composition of two functions (together with the fact that 
DpH is globally bounded and that the higher-order derivatives of H are locally bounded). Faa 
di Bruno’s formula says that each may be chosen as a polynomial function. 

Therefore, by (fMl) and by (IMl) in the proof of Lemma 14.71 (choosing the constant Ck such 
that || 5 ^||fc+a + sup^g-p(']i’d) ||G(-,m)||fc+Q ^ Ck ), we deduce that 


essup^gf2 II hi II ^ Ck 


1-hessup^gf^ sup 4>fc ||us||fc+a-i + max ||u*J|fc+a-i 

sE[o,r] ^ 1=1,...,N 

rT 


1 


n 


-^ressup^gf^||us||fc+«-h essup^gQ max \\ul\\k+a\ds 

ifs — t i=l,...,N 
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Now (independently of the above bound), by (j 92 |) and Lemma 14.71 we can modify Ck in such a 
way that 


r 


eSSUp^^gQ ||'^s||fc+a 


\/s — t 


ds ^ Ck 


s£[0,T] 


1 +essup^gQ sup ^k{\\us\\k+a-i + . max ||u*J|a.+o_i 




i: 


essup^gQp^lIfc+a + essup^gQ max pj[fc+a 

2=1,...,7V 


ds 


so that, collecting the two last inequalities (and allowing the constant Ck to increase from line 
to line). 


eSSUPj^gQ \\Ut II k+a 


^ Ck 


1 + essup^gQ sup llusllfc+a-i + max ||u*||fc+o,_i 
s£[0,T] ^ 1=1,...,N 

, rf II- II , essup^gQmaxi=i,,„,Ar llu^Jlfc+a 

+ J (^essup^gQllusIlfc+Q, H ^J s — t - 


ds 


( 93 ) 


^ Ck 


1 + essup^gQ sup llusllfc+a-i + max ||ii*J|fc+o,_i 

S£[0,T] 




+ / eSSUp^g!^||ud|fc+Q: + 


essup^gj^ maXi=i_,„^Ar sup^g[^^r] \\ul\\k+a 


\/~S^^ 

Now, notice that the last term in the above right-hand side may be rewritten 

rT 


ds 


I 


eSSUp^gQ ®^PrG[s,T] ll^rll^+o: 


y/s — t 


ds 


I 


essup^gQ maxi=i,,„,Ar sup^g[4+^^r] \\uU\k+a 


ds, 


which is clearly non-increasing in t. Returning to (j 93 p . this permits to apply Gronwall’s lemma, 
from which we get: 


essup^jgQ lltij ||^_i_Q, ^ Ck 


l-hessup^gj^ sup ^k[\\us\\k+a-i + max ||u*J|fc+c,_i 
s£[0,T] ^ 2=1,...,7V 


i: 


essup^gf^ maxj=i,.„,Ar sup^g[^ -r] ||u;||fc+ai 
y/s — t 


ds 


( 94 ) 


In particular, if, for any s e [ 0 , T] and any i e { 1 ,... ,N}, essup^g^H 71 * 11 ^+ 0 , ^ exp(Afc(r — s)), 
then, for all t 6 [ 0 , T], 


I 


essup^gj^ maxi=i,,„,Ar sup^g[^^7.] ||u*||fc+a 


y/s — t 


ds ^ A; 


exp(Afc(r - s)) 




r 

Afcexp(Afc(r - 1 )) [ 

Jo 


y/s — t 


ds 


exp(-Afcs) 


( 95 ) 


ds, 
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the passage from the first to the second line following from a change of variable. Write now 


Afcexp(Afc(r - t)) f dg 

Jo VS 

A rrr^ /■®exp(-Afcs) 

= Afcexp(Afc(r - fj) / - - - ds 

Jo ys 

= Ak exp(Afc(r - t)) f 

Jo vs 

^ Afcexp(Afc(r - t)) / 

Jo VS 


Afc 

Afc 

Afc 


T-t 

+ GO 


L 

L 

L 


exp(-Afc(s - (T - t)) 

Vi 

exp(-AA:S) 


ds 


\/T — t + s 
exp(-Afcs) 


ds 


VT 


ds, 


and deduce, from ([M|) and ([95]) . that we can find two constants 7 i(Afc) and 72 (Afc) that tend to 
0 as Afc tend to +oo such that 


essup^gQ||ut||fc+a 




+ essup^gf^essup^g[o,r] (III fc+a-i 


max \\ul\\k+a-i) - Afc7i(Afc) 
2 = 1,....A' 


+ 72 (Afc)Afc exp(Afc(T - t)) 


Choosing A^ first such that 72 (Afc)Cfc ^ 1 and then A^. such that 

1 + essup^gQessup^gp^Tj^fcdl'UsIlfc+a-i + ._max^||{i*||A:+Q,-i) ^ 7i(Afc)Afc, 
we finally get that 


essup^g!^||itt||fc+a ^ Afcexp(Afc(r - t)). 


The proof is easily completed by induction. □ 

4.3.3 Case (??,ro) = (1,0) 

Using a standard contraction argument, we are going to prove: 

Proposition 4.10. Given some adapted inputs ibt)te[o,T]! {ft)te[o,T] and gr satisfying 

essup^gQ sup ||6t||i, essup^gQ sup ||/t||„+a_i, essup^gs^||c/T||n+a < oo, 
is[0,T] 4e[0,T] 

the system dSB), with '3 = 1 and w = 0, admits a unique adapted solution {'mt,ut)te[o,T]> with 
paths in C°([0,T],P(T'^)) x C0([0,r],C’"(T'^)). It satisfies 

eSSUp^gQ sup ||ut||n+a < 00 . 
ts[0,T] 

Proof. Actually, the only difficulty is to solve the backward equation. Once the backward 
equation has been solved, the forward equation may be solved by means of Lemma 14.41 

In order to solve the backward equation, we make use of the Picard fixed point theorem. 
Given an {lFt)te[o,T] adapted process (ut)te[o,T]) with paths in C'^([0,T],C"’(T'^)) and satisfying 
essup^^gQ sup^gj-Q^'P] ||ut||n+a < 00 , we denote by (V) 4 g[o,t] f^ie solution to the backward equation 
in (l8T]l . with 'd = w = t) and with {ft)te[o,T] replaced by {ft + Hfi-, Dut))te[o,T]- By Lemma l4^ 
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the process {u^)te[o,T] belongs to C'^([0, T], C”'(T'^)) and satisfies essup^g^ sup^gj-Q j’] jlu^Hn+a < oo- 
This defines a mapping (with obvious domain and codomain) 

^ : {ut)te[0,T] ^ {K)te[0,T]- 

The point is to exhibit a norm for which it is a contraction. 

Given two adapted datas (ut)tg[o,T]! * = 1)2, with paths in C°([0, T], C"’(T'^)) and with 
essup^gQ supjg[ 0 _T] \\ul\\n+a < 00 , z = 1,2, we call {ut')te[o,T], * = 1,2, the images by T. By 
Lemma [T9] (with = 1, = 1 and = 0), we can find constants {Xk, ^k)k=i,...,n such that 

the cylinder 

^ = {u;eCO([0,r],C-(T'')) :VA;6{0,...,n}, Vie [0,r], ||u;il|fc+„ ^ exp(Afc(r - f))}, 


is stable by T. We shall prove that T is a contraction on B. 

We let wt = u\ — ul and w[ = — uf', for t e [0, T], We notice that 

—dw'^ = [AzZi( — (Vt, Dwt)]dt — dNt, 

with the terminal boundary condition w'rp = 0. Above, (lVt)jg[o,T] is a process with paths in 
C'^([0,T],C"'“^(T'^)) and, for any x e T'^, {Nt{x))te[o,T] is a martingale. Moreover, {Vt)te[o,T] is 
given by 

Vt{x) = [ DpHt[x,rDu^[x) + {1 — r)Du^[x))dr. 
io 

We can find a constant C such that, for any e B, 

sup ||V)||n+a_i ^ C. 

Zs[0,T] 

Therefore, for any u^,u^ 6 B, for any k e {0,..., n — 1} 

Vfe[0 ,T], \\(Vt, Dwt)\\k+a ^ C\\wt\\k+l+a, w\=u^-u^. 

Now, following (f90|) . we deduce that, for any k e {1,..., n}. 


eSSUp^gf2 W'^t II k+a 


^ C 


i: 


essup^gj-2 II Wg i| fc+o 

\^s — t 


ds, 


(96) 


so that, for any fi > 0 

fT 


r f r exp(fit) \ 

j essup^gQ||u;(||fc+Q,exp(/it)cif ^ cy essup^gf^l|u}s|[fc+Q: ( y —j==dt\ds 

/■+* exp(-/is) _ I, ^ 

J -- dsj J essup^gQl|r(;s||fc+a,exp(/xs)ds. 


^ C 


Choosing /x large enough, we easily deduce that T has at most one fixed point in B. Moreover, 
letting = 0 and defining by induction = 'I'(-u*), i 6 N, we easily deduce that, for /x large 
enough, for any i,j e N, 


I 


T ... (J 

essup^gf2||ni+^ - uj||n+aexp(^f)(if ^ 
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so that (modifying the value of C) 

■ C 

eSSUp^gQl|Mi+^ - U\\\n+adt ^ . 

Therefore, by definition of B and by ([^5]) . we deduce that, for any 6 > 0, 

C 

VieN, supessup^gQ sup \\u\^^ - u]\\n+a ^ C^/e + 

jsN ie[0,T] ^ 

from which we deduce that the sequence (u* )jsN converges in C°([0,T],C"'(T‘^))). The 

limit is in B and is a fixed point of T. 

Actually, by Lemma 14.91 (with = 1 and = 0)1 fixed point must be in B, so 

that has a unique fixed point in the whole space. □ 



4.3.4 Stability estimates 


Lemma 4.11. Consider two sets of inputs {b,f,g) and {b',f',g') to (l8T]l . when driven by two 
parameters 'd,we [0,1]. Assume that {m,u) and {rh',u') are associated solutions (with adapted 
paths that take values in C*’([0,T],7^(T‘^)) x C°([0,T],C"'(T'^))j that satisfy the conclusions of 
Lemma \4.l^ with respect to some vectors of constants A = (Ai, ..., A„) and A = (Ai,..., A„). 


Then, we can find a constant C ^ 1, depending on the inputs and the outputs through A and A 
only, such that, provided that 

II 111 ^ 


essup^gj^ sup 
ts[0,T] 


1 

c 


it holds that 


E[ sup \\ut - u'(\l+a + d?(mt,m[)] 
te[0,T] 

^ C'|di(mo,mo)+E[ sup \\bt - b'^l 
te[o,r] 


sup ll/t - //lln+a -1 + I|5T 
te[0,T] 


fl’rlln+o] }■ 


Remark 4.12. The precise knowledge of A and A is crucial in order to make use of the convexity 
assumption of the Hamiltonian. 


The proof relies on the following stochastic integration by parts formula; 

Lemma 4.13. Let (mf)ig[o,T] an adapted process with paths in C*’([0, T], P(T‘^)) such that, 
with n as in the statement of Theorem \4.3\ for any smooth test function ip e C”(T‘^), E almost 
surely. 


Ijd 


ip{x)dmt{x) 


Ijd 


[A(/?(a:) - (/3t{x),Dip{x)}]dmt{x)ydt, te [0,r], 


for some adapted process {/3t)o<^tiiT with paths in C°([0,T], [C°(T'^)]'^). (Notice, by separability 
ofC^{T'^), that the above holds true, P almost surely, for any smooth test function ip e 

Let {ut)te[o^T] uu adapted process with paths in C*’([0, T], C"(T‘^)) such that, for any x e 


dtut{x) ='yt{x)dt + dMt{x), t6[0,r]. 


where (7t)tG[o,r] o,nd {Mt)te[o^T] ^xe adapted processes with paths in C*’([0,T],C‘^(T‘^)) and, for 
any x e {^^t{x))te[o,T] t-s a martingale. 


68 






Assume that 


(97) 


essup^gj^ sup {\\ut\\n + llAllo + hillo + \\Mt\\o) < 00. 


Then, the process 


^ut{x)dmt{x) - j |y^[7s(x) + Aus(x) - (l3s{x), Dus{x))]dms{x)^ds^ 


te[0,T] 


is a continuous martingale. 


Proof. Although slightly technical, the proof is quite standard. Given two reals s < t in [0,T], 
we consider a mesh s = rg < ri < • • • < tat = t of the interval [s, t]. Then, 


/ ut{x)dmt{x) — / Us{x)dms{x) 

Af-lr r- r- 

= S / '<^ri+-,ix)dmri+.^{x)- Uriix)dmriix) 

UTd- JTd- 

V-lr . . 

y! / Uri+dx)dmr^^.,{x)- Ur,^.,{x)dmr,{x) 

JT'i 

N—1 - „ „ 

y. / Uri^,{x)dmr^{x)- Ur^{x)dmr^{x) 
“T LiTd iTd 


i=0 


sn/,i- 


i=0 
N-l 



n+i 


^r{x)dr + Mr (x) - Mr.(x) >(imri(x). 


i=0 


By conditional Fubini’s theorem and by (|97p . 


E 


N-l 




2 / - Mt,(x)}dmt.(x)|7; 


N-l f. 

■ § X- 


SO that 


{E[Mt,_^^(x) - Mt,(x)|T;]}dmi,(x) = 0, 

E[S^\J^s] = 0, 


where we have let 

:= [ ut{x)dmt{x) - [ r(,(x)dm,(x) 


Ijd 

N-l f. 

- 2 / 

N-l 

-E 

i=0 


/Td 


r’i+l 



Td 

r’i+l 


[Ariri+i(x) - (f5r{x),Duri_^_i{x))]dmr{x) Ur 


jr{x)dr Umrj(x). 


/■Jd 


(98) 
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Now, we notice that the sequence converges pointwise to 


5“ := / ut{x)dmt{x) — / Us{x)dms{x) 


Ijd 


Ijd 


[[L [Aur(ic) — (.I3r{x), Durix)} + jr{x)]dmr{x)^dr. 


As the sequence (5''^)7v^i is bounded in L”(r2,A,P), it is straightforward to deduce that, P 
almost surely, 

= lim PrS^I-FJ = 0. 

Af^co 

□ 


We now switch to 


Proof of Lemma Following the deterministic case, the idea is to use the monotonicity 
condition. Using the same duality argument as in the deterministic case, we thus compute by 
means of Lemma 14.131 


dt / (ut - ut)d{rh'i - rht) 

= j--!? JDut,DpHt{-,Du'i-)dm[- DpHt{-,Dut)dmt) 

- [ {Du[ - Dut, b[dm[ - btdmf) + ?? / Dii'f) - Ht{-, Dut))d{m[ - mf) 

-w j {Ft{-,rn[)-Ft{-,mt))d{m[-fht)+l {f[. - ft)d{m[ - mf)\dt 


dMt, 


where (Mi)jg[o,T] is a martingale, with the terminal boundary condition 

/ iu'rp — ut) d{m!rp — rriT) = vj / {G{-,m'rp) — G[-,mT))d{fnrp — friT) 


+ / {g'r - 9T)d{m'rp - ifiT). 

Making use of the convexity and monotonicity assumptions and taking the expectation, we 
can find a constant c > 0, depending on the inputs and the outputs through A and A only, such 
that 


?9cE 


Ijd 


\Du[ — Dut\^d(^rht + rh'^ 


dt 


^ lln'o - uo||idi(mo,mo) + E[||fi'T “ 5T||idi(mr, m^)] 


(99) 


E 


\\b[ - btWoWu'i- - utWidt + E f {\\(bt,Du'i- - Dut)\\i + \\fl - ft\\i)di{mt,m[)dt. 

Jo 


We now implement the same strategy as in the proof of Proposition 13.21 in the deterministic 
case. Following (@2]), we get that there exists a constant G, depending upon T, the Lipschitz 
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constant of DpH and the parameters A and A, such that 

sup di{'m[,mt) 
te[0,T] 


^ Cl di(mQ,?fio) + sup \\b[ — btWo +'& / / \Du'^ — Dus\d{rhs + 

V te[o,T] Jo UTd. 


ds 


(100) 


which holds pathwise. 

Taking the square and the expectation and then plugging we deduce that, for any small 
r] > 0 and for a possibly new value of C, 


E[supd^(mt,m()] ^ cl?/ ^d^(mo, uiq) + r/E[ sup j[ut — 


te[0,T] 

+ r]~^essup^^Q sup ||64 ||iE[ sup ||ut —UtHi] 
tG[0,T] te[0,T] 

+ 7?"^E[sup \\bt-bt\\l+ sup ll/i -/t'll? + ||5T-5rlli]}- 
te[0,T] ie[0,T] '' 


( 101 ) 


Following the deterministic case, we let wt = ut — u^, for t e [0,T], so that 

- dwt = [Awt - iJ(Vt, Dwt) + ^R\ - {ft - fi)]dt - dNt, (102) 

with the terminal boundary condition wt = wIV + g'j' — gx- Above, {Nt)te[o,T] is a process with 
paths inC°([0,T],C°(T'^)), with essup^g^^ sup^gi-o^T] ll^illo < oo, and, for any x e T"*, {Nt{x))te[o,T] 
is a martingale. Moreover, the coefficients {Vt)te[o,T]^ (-^t)ts[o,T] and ff" are given by 

Vt{x) = [ DpHt{x,rDu{x) + (1 — r)Du'{xf) dr, 

Jo 

rl 

/(x) = / -r—(x, rmt + (1 — r)ml) (mt — mj) dr, 

Jo 

(x) = / - — {x,rmT + {^ — r)m'rp){mT — fnT)dr. 

Jo 


R 


R 


Following the deterministic case, we have 


sup \\Rl\\n+a-l + \\R^\\n+a ^ C SUp di{mt,m{). 

ts[0,T] ie[0,T] 


(103) 


Moreover, recalling that the outputs u and u' are assumed to satisfy the conclusion of Lemma 
Ml we deduce that 

sup IIV) IIjj-i-Q.—! ^ C. 

te[0,T] 

In particular, for any k e {0,... , n — 1} 

Vf e [0,r], \\(Vt, Dwt)\\k+a ^ C'||u)t||fc+l+a. 

Now, following ([9T]1 and implementing (I103p . we get, for any t e [0,r], 

\\kXt\\k+a 


< E 


IlffT ~ gxWk+a + 


r 


^ CE 


Ws^ ds + sup Wfs - f's\\k+a-i + sup di (m^, m'J | J) 
y/S-t sg[0,T] se[0,r] 

gx-gx\\k+a+ / \\Ws\\k+ads+ SUp ||/* - /'j| fc+a-1 + sup di(ms, m'J | J) 
Jt ssro.Tl ssro.Tl 
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the second line following from Lemma 14.81 (with p = 1). By Doob’s inequality, we deduce that 


E[ sup \\Ws\\l+a] 

se[t,T] 


^ E 


gr - grWk+a 



Ws\\l+ads+ sup 11 /^ 
ss[0,T] 


f'f 

^ s II k+a—1 


sup dl{rhs,mg) 
s£[0,T] 


By Gronwall’s lemma, we deduce that, for any k e {1,..., n}, 


E[ sup ^ CE \\gT-gT\\l+a+ sup \\ft-fl\\l+a-l 

ts[0,T] L ts[0,T] 


sup 

te[0,T] 


(104) 


We finally go back to (|10ip . Choosing r] small enough and assuming that essup^^gf^||6t||i is also 
small enough, we finally obtain (modifying the constant C): 


E[ sup {\\ut-u[\\l+^ + dl{mt,m[))] 

te[0,T] 

^ C'{d?(mo,mo)+E[ sup Wh - b[\\l + sup ||/t - //!ln+a-i + lls-T - fi'Tlln+ J 
ts[0,r] te[0,T] '' 


which completes the proof. 


□ 


4.3.5 Proof of Theorem 14.31 

We now end up the proof of Theorem 14.81 

First step. We first notice that the stability estimate in the statement is a direct conse¬ 
quence of Lemma 14.91 (in order to bound the solutions) and of Lemma 14.111 (in order to get the 
stability estimate itself), provided that existence and uniqueness hold true. 

Second step (a). We now prove that, given an initial condition mo e P(T'^), the system (1781) 
is uniquely solvable. 

The strategy consists in increasing inductively the value of ro, step by step, from ro = 0 to 
tu = 1, and to prove, at each step, that existence and uniqueness hold true. At each step of 
the induction, the strategy relies on a fixed point argument. It works as follows. Given some 
w e [0,1), we assume that, for any input (f,g) in a certain class, we can (uniquely) solve (in 
the same sense as in the statement of Theorem 14.3p 

dtrht = {Amt + div[mtBpHt(-,Dut)]}dt, . . 

_ _ _ (105) 

dtut = {-Aut + Ht{-, Dut) - ^Ft{-,mt) + ft}dt + dMt, 

with mo = mo as initial condition and ut = 'cuG{-,mT) + gT as boundary condition. Then, the 
objective is to prove that the same holds true for zu replaced by tu -I- e, for e > 0 small enough 
(independent of ro). Freezing an input {f,g) in the admissible class, the point is to show that 
the mapping 


^ : {mt)te[o,T] 


{ft = -eFt(-,mi) 

gr = eG{-,mT) + gr 


{dd't)telO,T], 


is a contraction on the space of adapted processes {'rht)te[o,T] with paths in C°([0,T],'P(T‘^)), 
where the last output is given as the forward component of the solution of the system (|105ll . 
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The value of w being given, we assume that the input (/, g') is of the form 

N N 

ft = -Y,b^Ft{-M), 9T = (106) 

i=l 

where N ^ 1, 6^,... ,6^ ^ 0, with e + + ■ ■ ■ + ^ 2, and (or equivalently 

(m*)j=i_,,,^Ar) is a family of N adapted processes with paths in C^([0, T], P(T'^)). 

The input ((/t)te[o,T])5T) being given, we consider two adapted processes aiid 

{'mf^)te[o,T] with paths in C°([0,r],T’(T‘^)) (or equivalently {m[^^)te[o,T] and im[^^)te[o,T] without 
the push-forwards by each of the mappings (T'^ 3 x i—* x — V2Wt € T'^)te[o,T], c/. Remark [12]), 
and we let 

+ fu [0,T]; Qt = -eG{-,mP) + gr] i = 1,2. 

and 

Tfjh') = <h(m^*^), z = 1,2. 

Second step (b). By Lemma Ill9l we can find positive constants {Xk)k=o,...,n and {Ak)k=o,...,n 
such that, whenever {rht,ut)te[o,T] solves (|105l) with respect to an input {{ft)te[o,T]^9T) of the 
same type as in (11061) . then 

V/se {0,... ,n}, Vte[0,r], essup^gs^||ut|| fc+a ^ AA,exp(Afc(r -t)). 

It is worth mentioning that the values of {Xk)k=o,...,n and {A.k)k=o,...,n are somehow universal 

in the sense that they depend neither on w nor on the precise shape of the inputs {f,g) when 
taken in the class (|106l) . In particular, any output ini^)te[o,T] of the mapping <I> must satisfy the 
same bound. 

Second step (c). We apply Lemma Pd. Ill with 6 = 6' = 0, (/t,//)os;ts:r = (/i^\/i ^^)os:t^T and 
{9T,9f) = deduce that 

E[ sup )] 

te[0,T] 

^ e^C'{E[ sup ||Ft(-,mJ^^) - Ft(-,mf-h ||Gr(-,m^^^) - Gr(-,m®)||2+J j, 
ts[0,T] ^ 

the constant G being independent of w and of the precise shape of the input (/, 5 ) in the class 
()106[) . Up to a modification of C, we deduce that 

E[ sup ^ e^C'E[ sup d^(?fi|^\ , 

te[0,T] te[0,T] 

which shows that <I> is a contraction on the space L^(n, aI,P;C^([ 0,T],'P(T'^))), when e is small 
enough (independently of w and of [f,g) in the class ()106|) 1. By Picard fixed point theorem, we 
deduce that the system (|105p is solvable when w is replaced by ro -(- e (and for the same input 
(/, 5 ) in the class (11061) 1. By Lemma lT9l and Proposition I4.11[ the solution must be unique. 

Third step. We finally establish the L°° version of the stability estimates. The trick is 
to derive the estimate from the version of the stability estimates, which seems rather 
surprising at first sight but which is quite standard in the theory of backward SDEs. 
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The starting point is to notice that the expectation in the proof of the version permits to 
get rid of the martingale part when applying Ito’s formula in the proof of Lemma 14.111 (see for 
instance (|M1) L Actually, it would suffice to use the conditional expectation given Tq in order to 
get rid of it, which means that the estimate may be written as 

E[ sup (df(mf,m() + \\ut - u't\\l+a)\J'o] ^ Cdl{mo,mo), 

te[0,T] 

which holds P almost surely. Of course, when mo and mg are deterministic the above conditional 
bound does not say anything more in comparison with the original one: When mg and mg are 
deterministic, the cr-field J-q contains no information and is almost surely trivial. Actually, the 
inequality is especially meaningful when the initial time 0 is replaced by another time t e (0, T], 
in which case the initial conditions become rht and m( and are thus random. The trick is thus to 
say that the same inequality as above holds with any time t e [0,T] as initial condition instead 
of 0. This proves that 

E[ sup (di(m 5 ,mj + - K\\‘i+a)\^t] ^ Cdi(m 4 ,m(). 

se[t,T] 

Since \\ut — u'^Wn+a is J'i-measurable, we deduce that 

\\ut - u[\\n+a ^ Cdi(mt,m(). 

Plugging the above bound in (|100jl . we deduce that (modifying C if necessary) 

sup di(mt,m() ^ Cdi(mo,mo). 

ts[0,T] 


Collecting the two last bounds, the proof is easily completed. 


□ 


4.4 Linearization 

Assumption. Throughout the paragraph, a stands for a Holder exponent in (0,1). 

The purpose here is to follow Subsection 13.31 and to discuss the following linearized version 
of the system ([78|) : 


(107) 


dtzt = {-Azt + (Vt{-),Dzt} - + ft}dt + dMt, 

dipt - Apt - div(ptVt) - d\Y{rhtVtDzt + b^) = 0, 
with a boundary condition of the form 

dG , \i \ ~o 

ZT = 

where [Mt)te[Q,T] is the so-called martingale part of the backward equation, that is [Mt)ts[o,T] 
is an (T't)jg|-g T^j-adapted process with paths in the space C°([0,T],C‘^(T‘^)), such that, for any 
X e T'^, {Mt{x))te\o^T] is an (Ji)tG[o,r] martingale. 

Remark 4.14. Above, we used the same convention as in Remark^J^ Por {pt)te[o,T] with paths 
in CO([0,r],(C"(T''))') for some k ^ 0, we let {pt)te[o,T] I'de distributional-valued random 
function with paths in C*’([0, T], (C^(T'^))') defined by 
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Generally speaking, the framework is the same as that used in Subsection 13.31 namely we 
can find a constant C ^ 1 such that: 

1. The initial condition po = po takes values in for some o! e (0, a), and, unless 

it is explicitly stated, it is deterministic. 

2. (Gt)ts[o,r] is an adapted process with paths in C‘^([0,T],C”(T'^,with 

eSSUp^gQ sup \\Vt\\n+a ^ C. 

te[0,T] 

3- (h^t)ts[o,r] is an adapted process with paths in ^*^([0,T],P(T'^)). 

4. (ri)jg[o,T] is an adapted process with paths in C°([0, T], such that, with prob¬ 

ability 1, 


sup llTtlli ^ C, 

te[0,T] 


V(t,x) 6 [0,T] X T'^, 


c-‘/j« r,{x) s c/j. 


5. (S?)*[0. T] is an adapted process with paths in C°([0,T], ^(T‘^))']'^), and {ft)te[o,T] 

is an adapted process with paths in C°([0, T],C”(T'^)), with 

essup^gf^ sup {\\b^\\_^n+a'-l) + \\f?\\n+a) < 00. 
te[ 0 ,r] 

6. is an J-^-measurable random variable with values in with 

essup^gQll^riU+i+a < 00. 


Here is the analogue of Lemma 13.51 

Theorem 4.15. Under the assumption (1-6) right above and (HFl(n)) and (HGl(n-|-l)), 
for n ^ 2 and /3 e (a', a), the system (1107^ admits a unique solution {p,z^M), adapted with 
respect to the filtration (Ti) 4 g[o,T]; paths in the space C°([0,T], (C”+i^(T'^))' x x 

Cn+d{jd^) and with essup^ supjg[o^-r] (||pt|1_(„+^) -h \\zt\\n+i+p + \\Mt\\n-i+p) < oo. It satisfies 

eSSUp^gQ sup {\\pt\\-(n+a') + \\h\\n+l+a + \\Mt\\n+a-l) < 00 . 
te[0,T] 

The proof imitates that one of Theorem 14.31 and relies on a continuation argument. For a 
parameter t) e [0,1], we consider the system 


dtzt = {-A5t -h (Vt{-),Dzt) - '&-^{-,mt){pt) + ft}dt + dMt, 

dtpt - - d\Y[ptVt) - divlfilrhtTtDzt + V)) = 0 , 

with the boundary conditions 

SG 

Po = Po, ZT ='6 j^{-,mT){pT) + gr¬ 


im 


(109) 
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As above the goal is to prove, by increasing step by step the value of that the system (jl08j) . 
with the boundary condition (11091) . has a unique solution for any e [0,1]. 

Following the discussion after Theorem 14.31 notice that, whenever {bt)te[o,T] is a process 
with paths in ^*^([0, T], (T'^)), for some (3 e (a', a), the quantity sup^gj-gj^] ||6t||_(„+„/) is 

a random variable, equal to sup^gj-g \\bt\\-(^n+a')- Moreover, 

essup^gQ sup \\bt\\-(n+a') = sup essup^gs^||6t|l_(„+„q. 
te[0,T] te[0,T] 

Below, we often omit the process {Mt)te[o,T] when denoting a solution, namely we often write 
{pt, zt)te[o^T] instead of (pt, zt, Mt)te[o,T] so that the backward component is understood implic¬ 
itly. We feel that the rule is quite clear now: In a systematic way, the martingale component 
has two degrees of regularity less than {zt)te[o,T]- 

Throughout the subsection, we assume that the assumption of Theorem 14.151 is in force. 


4.4.1 Case i? = 0 
We start with the case ?? = 0: 

Lemma 4.16. Assume that = 0 in the system (llOSj) with the boundary condition (llOQj) . Then, 
for any /3 e (a', a), there is a unique solution {p, z), adapted with respect to [3Ft)te[o,T]; with paths 
in C°([0, T], (C^+^(T'^))' x C”+^+^(T'^))) and with essup^ supig[g_r](||pt||_(„+^) -h ||zt||„+i+; 3 ) < oo. 
Moreover, we can find a constant C, only depending upon C, the bounds in (HFl(n)) and 
(HGl(n+l)), T and d, such that 

eSSUp^gQ sup ll/ij II^ C (IIpoII — (n+a') T eSSUp^^gj^ sup IIII , 
te[0,T] ie[0,T] 

essup^gj^ sup pt||„+i+„ ^ C'(essup^gQ||ffy|U+i+a-h essup^gj^ sup ||/°||„+„). 
te[0,T] ie[0,T] 

Proof. When i? = 0, there is no nonlinearity in the equation and it simply reads 

(i) dtzt = {-Azt + (Vt{-),Dzt) + ft}dt + dMt, 

(ii) dtpt - Apt - div(piFt) - div(6^) = 0, 


( 110 ) 


with the boundary condition /5g = pg and zt = g^- 

First step. Let us first consider the forward equation (IllOK iill. We notice that, whenever pg 
and {bt)te[o,T] are smooth in the space variable, the forward equation may be solved pathwise in 
the classical sense. Then, by the same duality technique as in Lemma 13.51 (with the restriction 
that the role played by n in the statement of Lemma 13.51 is now played by re — 1 and that the 
coefficients c and b in the statement of Lemma 13.51 are now respectively denoted by b^ and /®), 
for any fie [a',a], it holds, P almost surely, that 


sup ||pf||_(„+p) ^ C"(||pg||_(„+p) -h sup ||6°||_(„_i+p)). 
te[0,T] te[0,r] 


( 111 ) 


Whenever pg and (&i)te[o,T] are not smooth but take values in (C"'''“'(T‘^))' and (C’^'''"'“^(T‘^))' 
only, we can mollify them by a standard convolution argument. Denoting the mollified sequences 
by (p^)Ar^i and {{iit'^)te[o,T])N^i-, h is standard to check that, for any fi e (a', a), P almost 
surely, 

lim (||p^-po||_(„+p)-f sup ||6°’^ - 6f||_(„_i+p)) = 0, (112) 

ts[0,T] 


W —>+00 
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from which, together with (jllip . we deduce that, P almost surely, the sequence {{p^)te[o,T])N^i 
is Cauchy in the space C([0,T], (C”+^(T'^))'), where each {p^)te[o,T] denotes the solution of the 
forward equation (jllOI -liili with inputs {Pq , (^i’^)ts[o,T])- With probability 1 under P, the limit 
of the Cauchy sequence belongs to C([0,T], and satisfies (jllll) . Pathwise, it solves 

the forward equation. 

Note that the duality techniques of Lemma 13.51 are valid for any solution {pt)te[o,T] of the 
forward equation in (|110l -fiilL with paths in C^([0, T], This proves uniqueness to 

the forward equation. 

Finally, it is plain that the solution is adapted with respect to the filtration The 

reason is that the solutions are constructed as limits of Cauchy sequences, which may be shown 
to be adapted by means of a Duhamel type formula. 

Second step. For the backward component of (IllOp . we can adapt Proposition 14.101 the 
solution is adapted, has paths in C*^([0, T], for any (3 e (a', a), and, following (1821) . 

it satisfies: 

eSSUp^gQ sup pt||„+i+a ^ C"(eSSUp^gj^|IffrlU+l+« + eSSUp^gQ sup \\f°\\n+a), 

OjgtjgT te[0,T] 

which completes the proof. □ 


4.4.2 Stability argument 

The purpose is now to increase d step by step in order to prove that (I108p - (I109I) has a unique 
solution. 

We start with the following consequence of Lemma 14.161 


Lemma 4.17. Given some d e [0,1], an initial condition pQ in (T'^))', a set of coeffi¬ 

cients (Pi, m-i, Ft)jg[o,T] in points 2, 3 and 4 of the introduction of Suhsection \4.4\ o-nd a set of 
inputs {(b^Jj!) is[o,T],5T) in points 5 and 6 of the introduction of Suhsection \4.4\ consider a 
solution (pt, zt)te[o,T] of the system (|108l) with the boundary condition (llOOp . the solution being 
adapted with respect to the filtration having paths in the space C°([0, T], x 

C°([0,r],C’^+^+^(T'^)), for some P e {a', a), and satisfying essup^g^ supig[o_ 7 .](||pi|l_(„+, 3 ) + 

il■S^I!n+l+/3) ^ 

Then, 


essup^gQ sup 
i£[o,r] 


Pt\\-{n+a') + 


|| n+l + Q : 


< 00 . 


Proof. Given a solution (pt, zt)te[o,T] in the statement, we let 


b^t = 


6? 


'drhtTfDz, 


ft 


.SFi 


SG 


=/t - i?—(•,mi)(pi), t6[0,T]; gf = gf+'&—{■, mT){pT) 


6m 


6m 


Taking benefit from the assumption (HFl(n)), we can check that (&t)ig[o,T]) {ft)te[o,T] and g^ 
satisfy the same assumptions as (6i)ts[o,T], ift)te[o,T] and g^ in the introduction of Subsection 
14.41 The result then follows from Lemma 14.161 □ 


The strategy now relies on a new stability argument, which is the analog of Lemma 14.111 

Proposition 4.18. Given some d e [0,1], two initial conditions po and p'q in 

two sets of coefficients {Vt,mt,Tt)te[o,T] ond (Pt^ hr], r[) 4 g[o,T] os in points 2, 3 and 4 of the 
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introduction of Subsection \4.4\ and two sets of inputs ((&t,/°)te[o,T] > ^t) ^ ft')te[o,T]y 9 t) 

as in points 5 and 6 of the introduction of Subsection \4-4\ consider two solutions [pt-,zt)te[o,T] 
and [p'f,z't)te[o,T] of the system (jlOSp with the boundary condition (IIOQI) . both being adapted 
with respect to the filtration having paths in the spaee C*’([0, T], x 

C°([0,T],C’*+^+^(T'^)), for some (3 e {a', a), and satisfying 

eSSUp^gQ sup [\\pt\\-(n+P) + Pt||n+l+/3 + \\Pt\\-{n+P) + Ptlln+H-/?) < 00- 
te[0,T] 


Then, it holds that 


E 


sup 

\\zt - 

z'tl 

UG[o,r] 


^ c'l 

Wpo - 

Po 

-7 E 

sup 

fb 


Ue[ 0 ,r] 

-7 

sup 

{(1 


iG[0,T] 



-'ll 2 


tG[o,r] 


:o i:o'ii 2 


ZtWn+l+a 


70/II2 


;; 0/||2 


ts[0,T] 


+ \\p'tf-i^n+a')) {\\Vt - + iir* - r[\\l)] I 


the constant C only depending upon C in the introduction of Subsection \4-4i T, d, a and a'. 

Proof. First step. The first step is to make use of a duality argument. 

We start with the case when po, pg, 6° and b^' are smooth. Letting 6° = 'dfhtTtDzt + b^ 
and b^' = ^rh[T[Dz[ + b^', for t e [0,T], we notice that {pt)te[o,T] and {pt)te[o,T] solve the 
linear equation (ii) in pTO]) with (b^)te[o,T] and (6?')ts[o,T] replaced by {bf)te[o,T] and (b^')te[o,T] 
respectively. By Lemma 14.161 with (6°)fg[o,T] in (|110P equal to )fE[o,T] and with n in the 
statement of Lemma 14.161 replaced by n — 1, we deduce that {pt)te[o,T] and {pt)tG[o,T] have 
bounded paths in C°([0, T], (C"“^+^(T^))'), for the same (3 e (a', a) as in the statement of 
Proposition 14.181 

With a suitable adaptation of Lemma 14.131 and with the same kind of notations as in Sub¬ 
section [331 this permits to expand the infinitesimal variation of the duality bracket fzt — z[,pt — 
p't)Xn,X'.,,i with Xn = We compute 


dt{^t-z't,Pt-p't)x^,x',, 



where is a martingale and where we applied Remark 14.141 to define {pt)te[o,T] and 

iPt)te[o,T]- Ari important fact in the proof is that the martingale part in (IllOp has contin¬ 
uous paths in C°([0,T],which permits to give a sense to the duality bracket 
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(in x) with (pt — p'i)te[o,T]i since {pt — p't)te[o,T] is here assumed to have continuous paths in 
C°([0, T], Similarly, the duality bracket of {zt — -St)te[o,T] with the Laplacian of 

{Pt — p't)te[o,T] makes sense and, conversely, the duality bracket of {pt)te[o,T] with the Laplacian 
of {zt — -Si)te[o,T] makes sense as well, the two of them canceling with one another. 

Of course, the goal is to relax the smoothness assumption made on pQ, pg, 6° and W'. 
Although it was pretty straightforward to do in the deterministic case, it is more difficult here 
because of the additional martingale term. As already mentioned, the martingale term is defined 
as a duality bracket between a path with values in C^([0, T], and a path with values 

in C*^([0, T], Of course, the problem is that this is no more true in the general 

case that {pt — p't)te[o,T] has paths in C°([0,T],In order to circumvent the 
difficulty, a way is to take first the expectation in order to cancel the martingale part and then 
to relax the smoothness conditions. Taking the expectation in the above formula, we get (in the 
mollified setting); 


^¥{{h-z[rpt-p't')^^^x'} 

= [-V.\(D{~zt - ~z'^)rp't{Vt-Vl) 
E (f?-f7,pt-p't 

6Ft 


E 


-E 


D~z[,{Vt-Vl){pt-p[) 


Xr.,X'J 




6m 




x„,x^ 

{■,mt){pt- pt),pt- p't 

+ E 

(^D[zt - z[),rhtTtD{zt - z[) 


D{zt-~z[)M-Vl' 








E 


D{zt - z[), {mtTt - m[T[)Dz[ 


(113) 




Whenever pQ, pg, and are not smooth (and thus just satisfy the assumption in the statement 
of Proposition 14.18]) . we can mollify them in the same way as in the first step of Lemma f4.16[ We 
call (/0^)p^i, (/Oo’^)p 5 :i 5 and (b^'’^)p^i the mollifying sequences. For any (3’ e {a', a) 

and P almost surely, the two sequences respectively converge to po and pg in norm || • ||_(„_|_^/) and 
the two last ones respectively converge to 6^ and b^' in norm || • uniformly in t e [0, T]. 

With {pt, -?t)te[o,T] and (p(, -S()te[o,T] the original solutions given by the statement of Proposition 
14.181 we denote, for each A" ^ 1, by {p^, z^)te[o,T] and (Pt -St’^)ts[o,T] the respective solutions 
to (|110l) . but with {b^, , g^)te[o,T] respectively replaced by 


(bt'^ = + '&rhtTtDzt,ff = ft - '&^{-,mt){pt),gT = Pt + , 

V dm dm me[o,T] 

and = 6°'’^ + ^m'tT'tDz'tJ^' = /f - ^^{■,m't){pt), g^ = g^ + 


By linearity of (IllOp and by Lemma [4. 161 we have that {p^)n^i and {Pt^)N^i converge to pt 
and Pt in norm || • uniformly in f e [0,T], and that {z^)n^i and {zt^)N^i converge to 

Zt and z't in norm || • \\n+i+p, uniformly in f 6 [0, T]. 

Then, we may write down the analogue of (I113p for any mollified solution {p^, z^)n^i (pay 
attention that the formulation of (|113|) for the mollified solutions is slightly different since the 
mollified solutions only satisfy an approximating version of (llOSp i. Following (I112p . we can 
pass to the limit under the symbol E. By Lemma 14.161 we can easily exchange the almost sure 
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convergence and the symbol E, proving that the identity (jll.Sp holds true under the standing 
assumption on po, Po>, (&°)te[o,r] and (bf)ts[o,T]- 

Using the convexity of F and the monotonicity of F, we deduce that 


E 


(zt-z't,Pt-Pt}x^,{x„Y 


c-^m 


ijd 


\D(^Zs — Zg)\‘^drhs ]ds 


< 


^[(zo - Zq, Po - +C'E ^ &[\\ps - PsW-in+a') + \\Zs - Z^Wn+l+a) ds 


where 


0 : = 


\\P0 — Po\-{n+a') + WPt ~ drWn+l+a 


sup 

s£[0,T] 


11 /°-/: 


s \\n+a 


+ Wb^.-b^JW- 


s II—(n+a'—1) 


+ (Psl|n+l+« + l|p/|-(n+«'))(ll^^ - KlU+a + di{ms,m'^) + 



Recalling that 


(ZT - zt,pt - 

^ -C'QWpt - PT\\-(n+a'), 

where we have used the monotonicity of G to deduce the second line, we thus get 


??E 


Jo \JTd. 

^ C'E 


D(^Zs — z'g^\ drhsjds 
0 ^ 11^0 — Zglln+l+a + \\pT — PT\\-(n+a') 

)ds 


(114) 


+ / {WPs — P's\\-{n+a') + W^s — K\\n+1 + 

Jo 


Second step. As a second step, we follow the strategy used in the deterministic case in order 
to estimate {\\pt — Pt\\-{n+a'))te[o,T] ia terms of \D{zs — z'g')\‘^drhs)ds in the left-hand side 

of (fTTHl . 

We use again a duality argument. Given ^ e and r e [0,T], we consider the 

solution (u)t)te[o,T]) paths in C°([0,r],C"'+^(T‘^)), to the backward PDE: 


Stwt = {-Awt -f (Vt{-),Dwt)}, 


(115) 


with the terminal boundary condition Wr = Pay attention that the solution is not adapted. 
It satisfies (see the proof in the last step below), with probability 1, 


Vie [0,r], 
\fte [0,r), 


'a’illn+a' ^ 

lln+l+o' 


^ IlClln+o') 

C' „ 


< 


\/t — t 


^lln+a'- 


(116) 
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Then, letting and following the end of the proof of Lemma 1531 we have 

dt(wt,pt — Pt)x~,{x-y 


- d(^Dwt,mtTtD{zt - z't)\ ,^-dt - id/Dwt, {mtTt - m'tr[)Dz't\ 


so that 


{?)Pt Pt)x~,{X~)' ^ ^ Il'ClIn+o' 


0 + 7? 


0 \?T'^ 


1/2 

\D{zs — z'g)\ drhs ) ds 


Therefore, 


^ c 


WPt PrW—in+a') 
Plugging (I117P into (|114p . we obtain 

pT 

7?E 

' Ifd 

Therefore, 


/ rT r \ 1/2 

Q + '&^J J \D{zs — z'g)\‘^dmsj ds 


(117) 


^ (^J \D{zs-z',)fdThsjds ^C'Ee(^ 


0(0+ sup \\zt-zl\\n+l+a 

te[ 0 ,T] 


e[ sup \\k-MUn+c 
Lts[0,T] 


II2 


^ C'E 


0(0+ sup \\zt - Z^Wn+l+a] 
^ te[0,T] ' 


(118) 


(119) 


Third step. We now combine the two first steps to get an estimate of {\zt — z[\\n+i+a)te[o,T]- 
Following the proof of (|104l) on the linear equation (11021) and using the assumptions (HFl(n)) 
and (HGl(n+l)), we get that 


E[ sup^ 11^4 - 5l||^+i+J ^ E 02 + ||pr - PTll-(n+aO + \\ps “ p'A-(n+a'/s 


te[0,T] 


( 120 ) 


By (11191) . we easily complete the proof. 

It just remains to prove (jllOp . The first line follows from Lemma 13.31 The second line may 
be proved as follows. Following ([Mp . we have, with probability 1, 


Vte[0,r), llTUtjl^+i+a'^ C" 


JU\ 


n+a' r 

-t Jt 


IllHslln+l+a' 
y/s — t 


ds . 


( 121 ) 


Integrating and allowing the constant C to increase from line to line, we have, for all t e [0, r), 

f ll'^siln+l+o' 


J t 

y/s — t 

^ c 

1 ? |n+Q( 

^ c 

-1 

+ 

P 


-ds 


[ 


I 


y/r — sy/s — t 


ds + i: \\Wr\\n+l+a' 


1 


ds ]dr 


y/r — sy/s — t J 


I 


n-\-a' II lln+l+a;' 
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Plugging the above estimate into (I121j) . we get that 


Vt 6 [0, r), sJt — t 
which yields, by Gronwall’s lemma, 

Vt £ [0,T), Il'ihtjlfl-l-l-l-Q;' 
which is the required bound. 


Il^t lln+l+a' 


lie 


n-\-a' 



T ^Il'W^r lln+l+a' 


dr 


C ||.|| 

/- 7 lls \\n+a‘ 

\/r -t 


□ 


4.4.3 A priori estimate 

A typical example of application of Proposition 14. 181 is to choose: = 0, {W', f^', g^') = (0,0,0), 

V = V', r = re in which case 

(pe^') = (0,0). 

Then, Proposition 14. 181 provides an a priori estimate of the solutions to ()107l) . (Pay attention 
that the constant C in the statement depends upon the smoothness assumptions satisfied by 
V.) The following corollary shows that the bound can be turned into an L°° bound. It reads 
as extension of Lemma 14.161 to the case when ?? may be non zero: 

Corollary 4.19. Given id £ [0,1], an initial condition pq in (C"+"^(T‘^))' and a set of inputs 
((^?)/t*)ts[o,r]) ffr) points 1-6 in the introduction of Subsection \4-f\ consider an adapted 

solution {pt, zt)tE[o,T] of the system (I108p - (110911 . with paths in the space C*^([0, T], (C”+1^(T'^))') x 
C°([0,T],C’*'‘'^'''^(T'^)) for some {3 e (a', a), such that 


essup^gj^ sup {\\Zt\\n+l+P + \\Pt\\-(n+P)j < ^C. 
ts[o,r] ^ ^ 

Then, we can find a constant C, only depending upon C, T, d, a and a', such that 


eSSUp^gQ sup [\Zt^n+l+a + \Pt\-{n+a') 

ts[0,T] ^ 


^ C'U ||po||-(n+a') + eSSUp^gQ 


{\\f^ 

I \ 


\gf\\n+l+a+ sup [\\ff\\n+a + \\bt\\-{n+a'-l) 
te[0,T] 


( 122 ) 


For another initial condition Pq in and another set of inputs {(ffl', ff')t^[Q^T]i 9^') os 

in points 1-6 in the introduction of Subsection \4.4[ consider an adapted solution (p), .S()te[o,T] of 
the system (I108p - (ll09p . with paths in the space C‘^([0, T], (C”'''^(T‘^))') x C°([0, T],C’^+^+^(T‘^)) 
for the same /3 e {a', a) as above, such that. 


essup^gj^ sup 
te[o,r] 


(ll^Ili 


1+1+/3 + \\Pt\\-{n+P) I < 00 


Then, we can find a constant C, only depending upon C, T, d, a and a' and on 
l|Po||-(n+a') + l|Poll-(n+a') + OSSUp^gQ [|| 5 ^||,i+i+q, + ||„+i+q,] 


essup^gj^ sup 
te[0,T] 


fUll I II f0/|| I II It I II^UCI 

Jt lln+a “T lilt lin+a ”r ITt II — (n+a' —1) ITi II—(n+a' —1) 


,0'll 


82 




























such that 


essup^gQ sup 
te[0,T] 


Zt - z[ 


||2 

Iln+l+Q 


~pt - p't 


2 

— (n+a^) 




Po - p'o 


2 

— (n+o') 


+ essup^gf^ 


( WPt ~ drlln+l+a + sup 

II tO i)0'||2 1 II fO f0'jl2 

Tt 1 —(n+a' —1) \\Jt Jt lln+a 

V te[0,T] 



essup^gj^ sup 
te[0,T]^ 


\Vt-V' 


|2 

I n+a 




l|ri-r;||2 


( 123 ) 


Proof. We start with the proof of (jl22p . 

First step. The proof relies on the same trick as that used in the third step of the proof of 
Theorem 14.31 In the statement of Proposition 14.181 the initial conditions po and Pq are assumed 
to be deterministic. It can be checked that the same argument holds when both are random 
and the expectation is replaced by a conditional expectation given the initial condition. More 
generally, given some time t e [0, T], we may see the pair (/Os, 5<i)sEp,T] the solution of the 
system (llOSp with the boundary condition (Iin9|] . but on the interval [t,T] instead of [0,T]. In 
particular, when pg = 0, (b^', f^', g^') = (0,0,0), V = V',T = V' (in which case {p',z') = (0,0)), 
we get 


E 


sup 

Lss[t,T] 


Zs lln+l+a 


II — (n+a') ) 


^ c 


I Pt- II —(n+o') 


E[e^\jzt] , 


where we have let 


© 


sup ||5°||_(„+„/_i) + sup ||/°||n+a + ||5Tl|n+l+Q- 
ss[t,T] s£[t,r] 


Second step. We now prove the estimate on p. From the first step, we deduce that 


II ? 11^ < F' 

IIIln+l + Q: ^ 




^ C 


||Pt||^(n+a') TeSSUp^gf^©^ 


(124) 


The above inequality holds true for any te [0,T], P almost surely. By continuity of both sides, 
we can exchange the ‘P almost sure’ and the ‘for all t e [0, T]’. Now we can use the same duality 
trick as in the proof of Proposition 14.181 With the same notations as in (IllSp and (11161) . we 
have 


Vte[0,T], Iluitjl 

n+a' ^ C'llCII n+a'- 

Then, we have 

{(iCt , ^ ^^OiPO^x^ [X^y IIII n+a' —1 (II II—(n+a' —1) T ||n+a) 

^ C* II ^||n+a' ^IIPoII — (n+a') T j^llPsll-(n+a') T ds'j , 

from which we deduce, by Gronwall’s lemma, that 

l|Pr!!-(n+a') ^ C'd l|Po||-(n+a') + SUp eSSUp^gj^© ) , 

^ te[0,T] '' 


83 






























and thus 

eSSUp^gf^ sup ||pfl|-(n+a') ^ C''(ll^oll-(n+a') 

te[0,T] ^ ' 

By ()124p and (11251) . we easily get a bound for z. 

Last step. It then remains to prove (I123p . By means of the first step, we have bounds for 

eSSUp^^gQ sup ( pt||n+l+a + Pjljn+l+a + \\pt\\-(n+a') + \\Pt\\-{n+a')) ■ 
tE[0,T] ^ '' 

Plugging the bound into the stability estimate in Proposition 14.18l we may proceed in the same 
way as in the two first steps in order to complete the proof. □ 


4.4.4 Proof of Theorem 14.151 


We now complete the proof of Theorem 14.151 It suffices to prove 

Proposition 4.20. There is an £ > 0 such that if, for some e [0,1) and (3 e (a', a), for 
any initial condition po in (T'^))' and any input ((&?)te[o,r]) (/f )te[o,T]) 9 t) 

troduction of Subsection [23 the system (I1U8P " (I109P has a unique solution (pt, Zt)te[o,T] 
paths in C‘^([0, T], (C"^+^(T‘^))') x C‘^([0, T], such t/iat essup^ sup^gj-g •p](||pt||_(„+p)+ 

\\zt\\n+i+p) < 00, {pt,Zt)te[o,T] also satisfying essup^supt^[Q^T-^{\\pt\\_(^n+^,) + \\zt\\n+i+a) < oo, then 
unique solvability also holds with d replaced by '& + s, for the same class of initial conditions and 
of inputs and in the same space; moreover, solutions also lie (almost surely) in a bounded subset 
of the space L“([0,T], (C("+"')(T'^))0 x L*([0,r],C”+i+“(T"')). 

Proof. Given i) 6 [0,1) in the statement, e > 0, an initial condition pg 6 (T'^))', an input 

i{bt)tG[o,T]i {ft)te[o,T]i Pt) satisfying the prescription described in the introduction of Subsec¬ 
tion 144] and an adapted process {pt, zt)te[o,T] {p having pg as initial condition) with paths in 
CO([0,r],(C^+^(T'^))') X CO([0,r],C”+i+^(T'^)) such that 

essup^gf^ sup (||pt|l_(„+„/) -t \\zt\\n+l+a) < OO, (126) 

te[o,r] 


we call 4 >£(p, z) the pair (p[, .z^ossissr solving the system (jlOSp with respect to the initial condition 
pg and to the input: 

if = emtVtDzt + if 

f?' = + 

gf = ^^(•>™'t)(pt) +9t- 

By assumption, it satisfies 

eSSUp^gj^ sup (||p[|| _(„+„/) -h p[||n+l+a) < 00, 
te[0,T] 

By Corollary 14.191 


eSSUp^gQ sup (fz[\n+l+a + ||p[||-(n+o')) 
ts[0,T] 


||po||-(n+o') + ceessup^gf 2 sup (||ptl|-(n+a') + Pt||n+l+o) 

ts[o,r] 

-t essup^gfj sup {\\if\_^n+a'-l) + ll/°l|n+a) + |IPrlln+l+c 
'-ts[0,T] 
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where c is a constant, which only depends on the constant C appearing in points 1-6 in intro¬ 
duction of Subsection 14.41 and on the bounds appearing in (HFl(n)) and (HGl(n-(-l)). 

In particular, if 


eSSUp^gQ sup {\\zt\\n+l+a + \\M-{n+a')) (127) 

te[0,T] 

^ 2C''( ||po||-(n+o') (ll^t ll-(n.+a'-l) + !l/°l|n+«) ), 

\ L te[0,T] V 


and 2C'ce ^ 1, then 

eSSUp^^gQ sup [llz'-f-Wn+l+a + \\Pt\\-{n+a')) 
te[0,T] 

^ 2C''( ||po||-(n+o') Il5rlln+l+a + SUp (||6 °||_(„+q,/_i) -|- j|/°|ln+o) ), 

\ L te[0,T] V 

SO that the set of pairs {p, z) that satisfy (|126p and (|127p is stable by $£ for s small enough. 

Now, given two pairs {pj, zl)te[o,T] and {Pt, zf)te[o,T] satisfying (fT^, we let [pj', 4 0te[o,T] 
and {pf, zf)te[o,T] be their respective images by <I>£. We deduce from Proposition 14.181 that 


E 


sup 

Us[0,T] 


\Zt 


z2l\\2 

lln+l+a 


11-1/ -2/II2 

sup IIPt - Pt Il_(„+a/) 

is[0,T] 




C's^E 


sup 

Lts[0,T] 


ll?l - ?2||2 

W'^t ■^t lln+l+a 


sup WpI - > 

is[0,T] '1 


for a possibly new value of the constant C, but still independent of "d and e. Therefore, for C'e^ < 
1 and 2C'ce ^ 1, $£ is a contraction on the set of adapted processes {pt, zt)te[o,T] having paths 
in C°([0,T], (C”'+^(T‘^))') x C°([0, T],and satisfying (I127p (and thus (I126p as well), 
which forms a closed subset of the Banach space C°([0, T], (C^'+^PT'^))') x C‘^([0,T],C^'+^PT'^)). 
By Picard fixed point theorem, we deduce that <I>£ has a unique fixed point satisfying (112711 . 
The fixed point solves (llOSp - fllOhp . with ?? replaced by i? -I- e. 

Consider now another solution to (I108l) - (jl09ll with i} replaced by i? -I- e, with paths in a 
bounded subset of C°([0, T], (C’^+^pT^))') x C°([0, T],C”''‘^'''^(T‘^)). By Proposition 14. 181 it must 
coincide with the solution we just constructed. □ 
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5 The second-order master equation 

Taking benefit of the analysis performed in the previous section on the unique solvability of the 
MFG system, we are now ready to define and investigate the solution of the master equation. 
The principle is the same as in the first-order case: the forward component of the MFG system 
has to be seen as the characteristics of the master equation. The regularity of the solution of 
the master equation is then investigated through the tangent process that solves the linearized 
MFG system. 

As in the previous section, the level of common noise (5 is set to 1 throughout this section. 
This is without loss of generality and this makes the notation a little bit simpler. 

5.1 Construction of the Solution 

Assumption. Throughout the paragraph, we assume that the assumption of Theorem 14.31 is in 
force, with a e (0,1). 

For any initial distribution mo £ P(T^), the system (1781) admits a unique solution so that, 
following the analysis performed in the deterministic setting, we may let 

U{0,x,mo) = uo{x), xeT'^. 

The initialization is here performed at time 0, but, of course, there is no difficulty in replacing 
0 by any arbitrary time to e [0,r], in which case the system (I78p rewrites 

dtrht = {Amt + div(mtDpHtoA'^^^t))}dt, 

r ~ ~ 1 ~ (128) 

dm = {-Aut + Dut) - Ft^^t{-,'rnto,t)]dt + dMt, 

with the initial condition rht^, = mo and the terminal boundary condition ut = GtQ{-,mtQ^T), 
under the prescription that 

mto^t = {id + V2{Wt - Wto))‘!\rht, 

= F{x + V2{Wt - IFtJ,//), 

Gt,{x,p) = G{x + V2{WT-Wt,),ti), 

Ht,,t{x,p) = H{x + V2{Wt -Wt,),p), 

It is then possible to let 

U{to,x,mo) = utoix) 

We shall often use the following important fact: 

Lemma 5.1. Given an initial condition {to,mo) e [0, T] x V{T'^), denote by {mt,ut)te[tQ,T] Ide 
solution of (I128P with the prescription (11291) and with mt^ = mo as initial condition. Call 
mtQ^t the image of rht by the random mapping 3 x x -I- -\/2(IW — Wt^) that is mt^^t = 
[id + \^{Wt — WtQ)]‘^mt. Then, for any to + h e [to, T], P almost surely, 

utQ+h{x) = U{to + h,x + V2{Wto+h - Wto),mta^to+h), x e T'^. 

Proof. Given to and h as above, we let 

fht = [id + V2(lTto+/i - IFto)]jlmt, ut{x) = ut[x - -s/^iWto+h - t 6 [to -t h,T{, x 6 T'^. 


(129) 

xeT'^, peM"*, /reP(T'^). 

xe T^. 
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We claim that {fht, ut)te[to+h,T] is a solution of (jl28|) - (jl29p . with Iq replaced by to + ^ and with 
'>T^to,to+h as initial condition. 

The proof is as follows. We start with a preliminary remark. For t 6 [to + h,T], 

[id + V2{Wt - ^0+= [id + V2{Wt - Wt,)]^mt = mt,,f (130) 

We now prove that the pair {mt,ut)tQ+h^t^T solves the forward equation in (11281) . To this 
end, denote by {Xto,t)te[to,T] fh® solution of the SDE 

dXtQ^t = —DpHto^t{Xto,t,Dut{XtQ^t))dt + V2dBt, t e [to,T], 

the initial condition Xtg^to having mo as distribution. (Notice that the equation is well-posed as 
Du is known to be Lipschitz in space.) Then, the process (W = XtQ^t+V^{WtQ+h—WtQ))te[to+h,T] 
has {rht = {id + \/2{WtQ+h — ^to))‘A'drit)te[to+h,T] as marginal conditional distributions (given 
{Wt)te[o,T])- The process satisfies the SDE 

dXt = -DpHt,,t{Xt - V2{Wt,+h - Wt,),Dut{Xt - V2{Wt,+h - lTiJ))dt + V2dBt 

= -DpHtQ+h,t{Xt, Dut{Xt)^dt + V2dBt, 

which is enough to check that the forward equation holds true, with ffit^+h = ^to,to+h as initial 
condition, see (jl30ll . 

We now have 

dm = [-Xut + - Ftg,t{-,rnto,t)}{- - V2{Wto+h - Wto))]dt 

+ dMt{--V2{Wt,+h-Wt,)) 

= [-Aut + {Hto+h,t{'^d:’iit) - FtQ+h,t{-,'mto,t)}]dt + dMt{- - ^/2{WtQ+h - bEtJ). 

Now, (|130p says that rritQ^t reads [id + V2{Wt — WtQ+h)]‘^iht, where {int)to+h^t^T is the current 
forward component. This matches exactly the prescription on the backward equation in (I128p 
and (|129p . 

If mtQ^to+h was deterministic, we would have, by definition of U, U{to + h, x,mto,tQ+h) = 
utQ+h{x), X e and thus, by definition of uto+h, 

uto+h{x) = U{to + h,x + V2{Wto+h-Wto),mto,to+h), x e T'^. (131) 

Although the result is indeed correct, the argument is false as rritQ^to+h is random. 

To prove (jl31l) . we proceed as follows. By compactness of P(T'^), we can find, for any e, 
a family of N disjoint Borel subsets A^,... ,A^ cz P(T'^), each of them being of diameter less 
than e, that covers 'P{T^). 

Eor each i e {1,..., N}, we may find m e Ah We then denote by (m[, the 

solution of pi28p - pi29p . with tg replaced hy tg + h and with / 2 i as initial condition. We let 

N 

rht = J]mllj^i{mto,to+h), 

2 = 1 
N 

2=1 

Since the events {nitQ^to+h £ for each i = 1,... ,N, are independent of the Brownian 
motion {Wt - Wto+h)telto+h,T], the process {mt,ut)te[to+h,T] is a solution of (I128|) - (ll29p . with 
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to replaced by io + ^ and with rhtQ^to+h as initial condition. With an obvious generalization of 
Theorem 14.31 to cases when the initial conditions are random, we deduce that 

N 

HW'^to+h -'^to+hWl+a] ^ CE[dl{mto+h,mt^+h)] = C Y^E[lji^i{mto,to+h)dl{mto^to+h, fJ-")]- 

i=l 

Obviously, the right-hand side is less than Ce‘^. The trick is then to say that reads 

U{to + h, ■, Hi). Therefore, 


N 


2 E[l^* {mto,to+h) \\uto+H -U{to + h, ■,H')\\l+a] ^ 


2=1 


Using the Lipschitz property of U{to + h, •, •) in the measure argument (see Theorem 14.3p . we 
deduce that 


E 




n+a 


^ Ce\ 


Letting e tend to 0, we complete the proof. 


□ 


Corollary 5.2. For any a' e (0,a), we can find a constant C such that, for any to e [0,T], 
he [0,T — to]? and mo £ 

\\Uito + h, ., mo) - Uito, •, mo)lL+„, ^ 

Proof. Using the backward equation in ()128p . we have that 


Utoi) = E 


fto+h 


PhntQ+hi.') I Ps—to{HtQ,t{' I Dns) FtQ^fi-,mtQ^g))ds 


'to 


Therefore, 


utoi-) - E{uto+h{-)) = E 


rto+h 

{^Ph tdjutQ+hi') / Ps—toi^Hto,fi '1 Dhs) Fto,t{'^ na^Q^s)^ ds 

Jto 


So that 




{Ph *'^)^4o + /i|L+q' 

rtQ + h ^ ^ 

+ C {s -to)~^^‘^\\HtQ,t{-,Dus) - Fto,t{-^''TT-to,s)\ 

Jto 


n+a' —1 


ds. 


It is well checked that 


E 


(T/i i(i)'UjQ^/jj 


n-\-a' 


^ C7/j(«-«')/2e 

^ C / l {«-«')/2 


r ^0+^llri-t-CK 


the last line following from Lemma 14.91 
Now, by Lemma EH 


E[uto+h] = E[U{to + h,- + V2{Wto+h - Wto),mto,to+^)] 

= ]E[f7(to + h,- + V2{Wto+h - WtQ),mto,to+h) “ ^{^0 + h,-,mo)] + U{to + h,-,mo), 
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where, by Theorem I4.81 it holds that 


E[U{to + h,- + V2{WtQ+h - Wto),mto,to+h) -U{to + h,-,mo)] 


n-\-a' 


< 


CE[\di{mto,to+h,mo)\] 


E 


\\U{to + h,- + V2{Wto+h - Wto),mo) -U{to + h,-,mo) 


n+a' 


which is less than "')/2_ 


□ 


5.2 First-order Differentiability 

Assumption. Throughout the paragraph, we assume that F, G and H satisfy (1261) and (I27p 
in Subsection 12.31 and that, for some integer re ^ 2 and some a 6 (0,1), (HFl(n)) and 
(HGl(n-|-l)) hold true. 

The purpose is here to follow Subsection 13.41 in order to establish the differentiability of U 
with respect to the argument mo. The analysis is performed at to fixed, so that, without any 
loss of generality, to can be chosen as to = 0. 

The initial distribution mo e V{T'^) being given, we call {m,u) the solution of the system 
(I75|l with mo as initial distribution. Following (IS51) . the strategy is to investigate the linearized 
system (of the same type as ()107p l: 

6F+ 

dtzt = {-Azi + (DpHt{-,Dut),Dzt} - —{■,mt){pt)}dt + dMt, 
dtpt - ^pt - div{ptDpHt{-,Dut)) - div{rhtDppHt{-, Dut)Dzt) = 0, 
with a boundary condition of the form 


= ^(•,mT)(pT). 

As explained later on, the initial condition of the forward equation will be chosen in an ap¬ 
propriate way. In that framework, we shall repeatedly apply the results from Subsection 14.41 
with 

Vti') = DpHt{-, Dut), Ft = DppHt{-, Dut), te[0,r], (133) 

which motivates the following lemma: 

Lemma 5.3. There exists a constant C such that, for any initial condition mo e 'P(T‘^), the 
processes {Vt)te[o T] o.nd (rt)jg[o T] (I133p satisfy points 2 and 4 in the introduction of Subsection 

B 

Proof. By Theorem 14.31 and Lemma 14.91 we can find a constant C such that any solution 
imt,ut)te[o,T] to (ITHP satisfies, independently of the initial condition mo, 

essup^gQ sup ||rei||„+i+o ^ C. 

te[0,T] 


In particular, allowing the constant C to increase from line to line, it must hold that 


essup^gj^ sup \\DpHt{-,Dut)\\ s^C. 

te[0,T] 
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Moreover, implementing the local coercivity condition (I26jl . we deduce that (assuming C ^ 1), 
with probability 1, for all t e [0,T], 

lirtlli ^ C ; Vx e C-^d ^ Ftix) ^ Ch, 

which completes the proof. □ 

Given y e and a d-tuple i e {0,..., n}'^ such that \l\ = XlILi ^ T'^ 9 x i—> 

(x, mo, y) e M the value at time 0 of the backward component of the solution to (jl32ll when the 
forward component is initialized with the distribution (—Clearly, D^5y e (C”'+“ (T'^))' 
for any a' e (0,1), so that, by Theorem 14.151 mo,y) belongs to C"’+"(T'^). (Recall that, 

for a test function (p e C"'(T'^), {D^6y)ip = (—1)1^1 ^ p{y).) Similarly, we may denote by 

Vi -yf 

the solution of (I182jl with = (—1)1^1 as initial condition. For simplicity, 
we omit rriQ in the notation. We then have 

= v^^\-,mo,y). (134) 


We then claim 


Lemma 5.4. Let mo e ViT^). Then, with the same notation as above, we have, for any 
a' 6 (0, a) and any d-tuple i e {0,... , n}'^ such that |Z| ^ n, 


lim essup^gfi sup + 

'<i3h-^0 tern XT V {n+a ) 


^t,y+h ~e,y 


n+l+Q 


Td-ah-^O te[0,T] 

Moreover, for any i e {0,... ,n — l}*^ with |£| ^ n — 1 and any i e {1,...,d}, 

limK\{o}3h->oessup^gt2SuPte[o,T](||Kdt’^’^^''' - Pt^) “ !_(„+„') 


= 0 . 


(135) 


, II 1 / -t.y+hei ~t,V\ 

+ \\h{zt 


■Mei,y\\ 


lln+l+a 


= 0 , 


where e* denotes the vector of the canonical basis and£-\-ei is understood as {£-\-ei)j = ij + df, 
for j e {1,... ,d}, 5l denoting the Kronecker symbol. 

In particular, the function 9 (x,y) n®(x,mo,y) isn-times differentiable with respect 

to y and, for any I 6 {0, ..., 71 }“^ with l^l ^ n, the derivative DyV^^\-,mo,y) : 3 x 

DyV^^\x,mQ,y) belongs to C’^''‘^''‘“(T‘^) and writes 


DyV^°\x,mo,y) = v^^\x,mo,y), {x,y) e T'^. 


Moreover, 

sup sup ||L>yn®(-,mo,y)||n+i+a < zo. 
moGViT^) j/gT^ 

Proof. By Corollary 14.191 (with a = a and of = of for some of e (0, a)), we can find a constant 
C such that, for all y e T'^, for all mo £ V(T^) and all £ 6 {0,..., n}'^ with \£\ ^ n, 

eSSUp^gf^ sup {fzf'^Wn+l+a + \\Pt''^\\-{n+c,’)) ^ C. 
te[0,T] 


In particular, 

||n^^^(-,mo,y)||n+i+a ^ C. 
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Now, we make use of Proposition 14.18l We know that, for any ol e (0,1), 




Therefore, for a' < a. Corollary 14.191 (with a' = a' < a and a = a) gives (I135p . This yields 


lim|u(^)(-,mo,y + h) - {■,mo,y)\ 


h^O 


n+l+a 


proving that the mapping T'^ 3 y v^^\-,mQ,y) e (^"+i+“(']r'^^ ig continuous. 
Similarly, for It"! ^ n — 1 and i 6 {1,... , d}, 


1 


lim \\-(D^Sy+he- - D^6y) + ^ n = 0, 

R\{0}3h^o'' 3 /|l-(n+a') 


or equivalently, 


As a byproduct, we get 

lim \[v^^\-,mo,y+ hei)-v^^\-,mo,y)\-v^^"^^^\-,mo,y) = 0, 

hj Tl+l + CK 

which proves, by induction, that 

Dlv^^\x,mo,y) = v^^\x,mo,y), x,yeT'^. 


This completes the proof. □ 

Now, we prove 

Lemma 5.5. Given a finite signed measure y on T'^, the solution z to p32p with y as initial 
condition reads, when taken at time 0, 

zo : 3 X zo(x) = / v^^\x,mo,y)dy{y). 


Proof. By compactness of the torus, we can find, for a given e > 0, a covering of 

made of disjoint Borel subsets, such that each Ui, i = 1,... ,N, has a diameter less than e. 
Choosing, for each i e {1,... , A"}, yi e Ui, we then let 

N 

y^ = Yih{Ui)8yi. 

i=l 


Then, for any tp e C^(T'^), with ||(/jj|i ^ 1, we have 


/ p{y)d{y-y^){y) = Y! / {p{y) - p{yi))dy{y) 

dTrf iUiJUi 


^ C'||/i||e, 


where we have denoted by \\y\\ the total mass of y. 
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Therefore, by Proposition 14.181 


zo-Y, v^^\-,mo,yi)dn{y) 


^ C'll/u||e, 


n+l+Q; 


where we have used the fact that, by linearity, the value at time 0 of the backward component 
of the solution to (jl32p . when the forward component is initialized with reads 


N 


i=l 


N 

S-'f'. 


Y^y{UiY^'>{-,mo,yi) = ^ / v^°\-,mo,yi)dy{y). 


By smoothness of in y, we easily deduce that 


zo- v^°\-,mo,y)dy{y) 


Ijd 


^ C'||yl|e. 


n+l+a 


The result follows by letting e tend to 0. 


□ 


On the model of Corollary 13.91 we now claim 

Proposition 5.6. Given two initial conditions mo,mQ e 'P(T‘^), we denote by {iTit,'Ut)te[o,T] 
and i'dit,Ut)te[o,T] ^de respective solutions of (1781) with ttiq and mg as initial conditions and by 
{pt, zt)te[o^T] the solution of (I132|) with mg — mo as initial condition, so that we can let 

5pt = m[-mt- pt, 6zt = u[ - ut - zt, te [0, T], 


Then, for any a' e (0, a), we can find a constant C, independent o/mo and mg, such that 

essup^gQ sup {\\6pt\\-i^n+a') + \\dzt\\n+i+a) ^Cdj{mo,mQ). 

OsSisST 


In particular, 


C/(0,-,mo) - C/(0,-,mo) 


v^°\x,mo,y)d{mQ - mo){y) ^ C'di(mo,mo), 


n+l+a 


and, thus, for any x e T^, the mapping V{T'^) 3 m >—>■ U{0,x,m) is differentiable with respect to 
m and the derivative reads, for any m e Viff^), 

su 

— {0,x,m,y) = v^°'>{x,m,y), y e 

The normalization condition holds; 

/ v^^\x,m,y)dm{y) = 0. 

JT‘i 

The proof is the same as in the deterministic case (see Remark 13.lOp . 

Proof. We have 


A F, _ _ 

dt{dzt) = {-A{Szt) + (DpHt{-,Dut),D{Szt)} -—{■,mt){6pt) + ft}dt + dMt, 
dt{6pt) - A{5pt) - diY[{5pt)DpHt{-,Dut)\ - d\\[mtD‘^pHt{-,Dut){D5ff) + It] = 0, 
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with a boundary condition of the form 


6G 


SZT = j^{-,mT){6pT) +gT, 


where 


bt = m[{DpHt{-,Du[) - DpHt{-, Dut)) - fhtD^pHti-, Dut){Du[ - Dut) 
ft = Ht{;Du[)-Ht{;Dut)-(DpHt{;Dut),Du't-Dut} 

- {m't - mj)), 

6G 

gr = G'(-, m^) - G{-,mT) - {m'rp - ttit) ■ 


Now, 


bt = {m[ - rht) {DpHt{-, Du[) - DpHt{-,Dut)) 


+ m 


DppHt{-,XDu[ + (1 - \)Dut) - DppHt{-,Dut) {Du[ - Dut)d\ 

= {m't - rht) j DppHt{-, XDu't + (1 - X)Dut) {Du't - Dut)dX 

XDlppHt{-,XsDu't + (1 - A + A(1 - s))Dut){Du't - Dut)®‘^dXds, 


1 /■! 


+ rnt 


f 



0 ^0 


ft= {DpHt{-, XDu't + {1 - X)Dut) - DpHt{-,Dut), Du't-Dut)dX 


1 


Am^ + (1 - X)mt) - (K “ mt)dX 


n X(^DppHt{-, XsDu't + (1 — A + A(1 — s))Dut) {Du't ~ D'dt ), Du't ~ DuffdXds 


^ “ ^t)dX, 


gT = 


6G 


6G 


6m ^ “ rnT)dX. 


By Lemma 14.91 we have a universal bound for 


essup^gj^ sup 
te[0,T] 


n+l+a “f llf^t ||n+l+a) • 


We deduce that 






C(di{m't,mt)\\u't - uth + \\u't - 


II? , 


ft\\n+a ^ c(\\u't - utlln+i+„ + 
Wgrln+i+a ^ C'd^(m^,?fir). 
Therefore, by Theorem 14.31 we deduce that 


essup^gQ sup ||6t||_i + essup^gQ sup ||/tj|n+a + essup^gs^H^TlIn+i+a ^ ddf (mj,,mo). 
Ositssr O^t^T 
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By Corollary 14.191 we get the first of the two inequalities in the statement. We deduce that 


||t/(0,-,mo) - 17(0,-,171,0) - ^ C'df(mo,mo). 

By Lemma 15.51 we complete the proof. □ 

Proposition 5.7. For any a' e (0,a), we can find a constant C such that, for any mo,mQ e 
P(T'^), any y,y' e and any index f e {0 ,... ,n}'^ with \i\ ^ n, denoting by {rht,ut)te[o,T] 

?i()fg[o,T] the respective solutions of ([75I1 . and then (pt, 5t)7e[o,T] o,nd {p'f,z't)te[o,T] the corre¬ 
sponding solutions of (11321) when driven by two initial conditions and , 

it holds that 


essup^gf^ 


sup 

.ts[0,T] 


Zt - Zi-Wn+l+a + sup 
ts[o,r] 


Pt Pt\\ — {n+a') 


^ c(di{mo,mo) + \y - pT')- 


In particular, 
Vy,y'eT‘^, 


L>y^(0,-,777,0, y) - Dl^{0,-,mo,y') 
^dm ^dm 


n+l+a 


^ c(di{mo,mQ) + \y - y' 


Proof. Given two initial conditions mo and tttq, we call irht,ut)te[o,T] and {m[,u^)te[o,T] the 
respective solutions of (178)) . With imt,ut)te[o,T] and (d7,(,'uO^gp^r]) associate the solutions 
{pti zt)te[o,T] and (p(, 7()tg[o^r] of (I132j) when driven by two initial conditions {—l)^^^D^6y and 
(—1)1^1Since \I\ ^ n, we have 

In order to prove the first estimate, we can apply Corollary 14.191 with 


Vt = DpH{-,Dut), Vf = DpHi;Du[), 

Tt = DlpHti;Dut), T[ = DlpHti;Du[), 

so that, following the proof of Proposition 15.61 


ht h) ||n+a “t llPt h^llo ^ Cllfti uf\n+l+a- 


Now, the first estimate in the statement follows from the combination of Theorem 14.31 and 
Corollary 14.191 

The second estimate is a straightforward consequence of the first one. □ 


Proposition 5.8. Propositions \5.6\ and easily extend to any initial time to e [0, T]. Then, 
for any a' e (0, a), any to e [0, T] and mo e P(T'^) 


lim sup 

^"^0£E{0,...,np,|£|^n 


, .6U 


Dy^—ito + h,-,mo,-) - D —{to,-,mo,-) 
^ dm y 


'5m 


n+l+a'jQ;' 


= 0 . 


Proof. Given two probability measures m,m' e P(T^), we know from Proposition 15.61 that, for 
any t e [0,r], 

U(t,-,m') — U(t,-,m) = j ^— {t,-,m,y)d{m'— m'){y) + 0{dL\{m,m')), (136) 
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the equality holding true in and the Landan notation O(-) being uniform in to and 

m (the constant C in the statement of Proposition 15.61 being explicitly quantified by means of 
Proposition 14.181 related to the stability of solutions to the linear equation). 

By Proposition 15.71 the set of functions 9 {x,y) i—> (<5C//(im)(t, x, m, y))^g|-o^ 7 ’] is rel¬ 
atively compact in ^"■+i+“'(']r'^) x for any a' e (0 ,q;). Any limit d> : —> M 

obtained by letting t tend to to in (I136jl must satisfy (use Corollary 15.21 to pass to the limit in 
the left-hand side): 

U{to,-,m) -U{to,-,m) = [ - rn){y) + 0{dl{rn,rn')), 

the equality holding true in C°(T‘^). This proves that, for any x e T'^, 


6U 


Ijd 


— {to,x,m,y)d{m-m){y) = j_ ^[x,y)d{m' - m)[y). 


fd 


Choosing m! as the solution at time h of the Fokker-Planck equation 


dtmt = -div(6mi), t ^ 0, 


for a smooth field h and with mo = m as initial condition, and then letting h tend to 0, we 
dednce that 

/ DmU{to,x,m,y) ■ b{y)dm{y) = Dy<^{x,y) ■ b{y)dm{y). 

Jld Jjd 

When m has full support, this proves that 

su 

^{x,y) =—{to,x,m,y) + c{x), x,yeT^. 

Since both sides have a zero integral in y with respect to m, c(x) mnst be zero. 

When the snpport of m does not cover T'^, we can approximate m by a sequence {mn)n^i 
of measures with full supports. By Proposition 15.7( we know that, for any a' 6 (0, a), 


SO that, in C^+i+“'(T'^) x C"'(T'^ 


.. , X SU, . 

hm sup -j—(t, •,m„, •) - — (t, •,m, •) 


Sm 


r^+l+Q:^C^:' 


= 0 , 


hm |^(t,-,m,.) = hm lim |^(t,-,m.,.) = (to,m, •) • 
t^to dm n->cot^to dm dm 

We easily complete the proof when |£| = 0. Since the set of functions ([T'^]^ 9 {x,y) 
{Dy6U/6m){t,x,m,y))te[o,T] is relatively compact in C'^+^+o^'{Y'^) x C"'(T‘^), any limit as t 
tends to Iq must coincide with the derivative of index ^ in y of the limit of 9 (x, y) i—> 

[6U/6m]{t, x, m, y) as t tends to to- D 


5.3 Second-order Differentiability 

Assumption. Thronghont the paragraph, we assume that F, G and H satisfy (j26h and (I27p 
in Snbsection 12.31 and that, for some integer n ^ 2 and some a e (0,1), (HF2(n)) and 
(HG2(n-|-l)) hold true. 


95 










In order to complete the analysis of the master equation, we need to investigate the second- 
order differentiability in the direction of the measure, on the same model as for the first-order 
derivatives. 

As for the first order, the idea is to write the second-order derivative of U in the direction 
m as the initial value of the backward component of a linearized system of the type (jl07|l . 
which is referred next to as the second-order linearized system. Basically, the second-order 
linearized system is obtained by differentiating one step more the first-order linearized system 
(I132p . Recalling that ()132p has the form 


(5K 

dtzt = {-Azt -f (DpHt{-,Dut),Dzt} - -^{■,mt){pt)}dt dMt, 
dtpt - Apt - div[ptDpHt{-,Dut)) - div{rhtDut)Dzt) = 0 , 


(137) 


with the boundary condition 

ZT = ^{■,'mT)ipT), 
dm 

the procedure is to differentiate the pair {pt, zt)te[o,T] with respect to the initial condition mo of 
imt,Ut)te[o,T]^ the initial condition of {pt, Zt)te[o,T] being kept frozen. 

Above, {rht,ut)ositsiT is indeed chosen as the solution of the system (TTHP . for a given initial 
distribution mo £ V{T'^), and {pt, zt)te[o,T] the solution of the system (jl37l) with an initial 
condition po e , for some a' < a. Implicitly, the initial condition pQ is understood 

as some mg — mo for another mg 6 P(T'^), in which case we know from Proposition 15.61 that 
[pt-,h)te[Q,T] reads as the derivative, at e = 0, of the solution to (ITSP when initialized with 
the measure mo -I- e(mQ — mo). However, following the strategy used in the analysis of the 
first-order derivatives of t/, it is much more convenient, in order to investigate the second-order 
derivatives of U, to distinguish the initial condition of {pt)te[o,T] from the direction mg — mo 
used to differentiate the system ([THI) . This says that, in (|137l) . we should allow (pt, zt)te[o,T] to 
be driven by an arbitrary initial condition pQ e (C”''““^(T‘^))'. 

Now, when (I137h is driven by an arbitrary initial condition pQ and mo is perturbed in the 
direction mg — mo for another mg 6 P(T'^) (that is mo is changed into mo -I- e(mg — mo) for some 
small e), the system obtained by differentiating (I137p (at e = 0) takes the form 


= [-Azf^ + (DpHti-,Dut),Dzi‘^^} - 

-h {DppHt{-,Dut),Dzt ® DdmUt) - dmmt)jdt dMt, 

dtpf^ - Apf^ - diY{pf^DpHt{-,Dut)^ - diY{rntDlpHt{-,Dut)Dzf'’^ 

- div{ptDlpHfi-, Dut)DdmU?j - di\{dmrhtDlpHfi-, Dut)Dzt^ 

- div(mtDpppHfi-, Dut)Dzt 0 DdmU^j = 0 , 


with a terminal boundary condition of the form 

~(2) 6G , , / (2)x S^G , .. . , 

dm dm^ 

where we have denoted by idm'dit,dmUt)te[o,T] the derivative of imt,ut)te[o,T] when the initial 
condition is differentiated in the direction m'^ — mo at point mo, for another mg 6 P(T'^). In 
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(I138p . the pair {pi ,zl jtg[o,T] is then understood as the derivative of the solution {pt, zt)te[o,T] 
to (fWll . 

Now, using the same philosophy as in the analysis of the first-order derivatives, we can choose 
freely the initial condition po- Generally speaking, we will choose po = for some 

multi-index I e {0,... ,n — 1}'^ with \l\ ^ n — 1 and some y e Since po is expected to be 

( 2 ) 

insensitive to any perturbation that could apply to niQ, it then makes sense to let p^ = 0. As 
said above, the initial condition of is expected to have the form ttiq — mo 

for another probability measure m^ e Anyhow, by the same linearity argument as in the 

analysis of the first-order derivative, we can start with the case when is the derivative 

of a Dirac mass, namely dmfno = for another multi-index k e {0, ...,n — 1}'^, 

and another ( e T^, in which case is another solution to ()137p . but with 

d^mo = as initial condition. Given these initial conditions, we then let 

provided that (|138p has a unique solution. 

In order to check that existence and uniqueness hold true, we may proceed as follows. The 
system pi38p is of the type (|lU7p . with 

Vt = DpHt{-,Dut), Tt = D'^pHti-,Dut), 

= ptD‘^pHti-,Dut)DdmUt + dmrhtD'^pHt{-, Dut)Dzt + mtD^ppHt{-, Dut)Dzt ® Dd^ut, 

6'^ F 

= (plpHt{-,Dut),Dzt®DdraUt')- j^{-,mt){pt,dmmt), 

9t = T^{-,mT){pT,dramT)- 
dm^ 

(139) 

Recall from Theorem 14.31 and Lemma 14.91 on the one hand and from Gorollary 14.191 on the other 
hand that we can find a constant C (the value of which is allowed to increase from line to line), 
independent of mo, y, Ci ^ and k, such that 


eSSUp^gQ sup \\ut\\n+l+a ^ C, 
te[0,T] 

eSSUp^gQ sup (IIZfII-|- ||dmiit||n+l+o T ilPiII —(n+a') T \\dmkblt 

'-te[0,T] 


— (n+a' 


)) 


^ c. 


(140) 


Since |£|, \k\ ^ n — 1, we can apply Gorollary 14.191 with n replaced by n — 1 (notice that n — 1 
satisfies the assumption of 115.2D . so that 


eSSUp^gf^ sup {\\pt\\-{n+a'-l) + \\Smmt 

Hg[ 0 ,T] 




-d) 


^ c. 


(141) 


Therefore, we deduce that 


essup^gQ sup ||6^||_(„+„/_i) ^ C. 
ts[0,T] 


Similarly, 


essup^gf^ sup ||/°||n+a -h essup^gj^ sup ||y°||n+l+a ^ c. 
is[0,T] te[0,T] 
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From Theorem 14.151 we deduce that, with the prescribed initial conditions, (jl38p has a unique 
solution. Moreover, by Corollary 14.191 

essup^gQ sup In+I+Q + essup^gQ sup pp !!_(„+„/) ^ C. (142) 

te[0,T] te[0,T] 

On the model of Lemma 15.41 we claim: 


Lemma 5.9. The function 

3 {x,y,C) ^ u(°’°)(x,mo,i/,C) 


admits continuous crossed derivatives in (y, C); W to the order n — 1 in y and to the order n — 1 
in Q, the derivative 

mo, 2 /, C) : T'^ 3 X mo, y, C), 

for 1^1, \k\ ^ n — 1, belonging to (^"■+1+“(']!’'=*) and writing 

L»yT>^u(°’°px,mo,y,C) = v^^'^\x,mo,y,0, x,y,C^ 

Moreover, for a' e (0,a), there exists a constant C such that, for any multi-indices i,k with 
|£|, PI ^ n — 1, any y, y', (, (' e T'^ and any mo 6 V{T'^), 

||P'’")(-,mo,y,C) -x('’'=)(-,mo,2/',C')|L+i+„ ^ C{\y-yr' + IC-CT')- 

Proof. With the same notations as in Lemma 15.41 we denote by ’^pf’^)te[o,T] solution 
to (fTT^ with as initial condition and by {pt’^, z^’^)te[o,T] the solution to (|132p with 

as initial condition. 

By Proposition 15.71 (applied with both n — 1 and n), we have, for any y,y' e T'^ and any 

C,C'eT^ 


essup^gQ 


ii-fc.C 
sup \\z^ 

J£[0,T] 



n+l+a 


S'J-P II Pt Pt II —(n+o'—1) 

te[ 0 ,r] 


^cic-cr 


essup^gQ 


sup 

_te[0,T] 





n+l+CK 


II It 

sup llPi"" -Pi’Ml-(n+a'-l) 

te[0,T] 


^C\y-y'r 


(143) 


Denote now by (&f ^’^’^)te[o,r] the process (&t)iE[o,T] iu (I139D when {pt, zt)te[o,T] stands for the 
process {Pt^, zl’'^)te[o,T] and {dmrht, dniUt)te[o,T] is replaced by (p^‘’P^‘’)^e[o,^] • Define in a 
similar way ift’’^’^’‘")te[o,T] and Then, combining (I143|) with (|14U|) and (|141|) 


essupj 


>\\\9f 


9t iln+l+o: 


sup 


U,k,y',C 


-b: 


Lfc.y.Cll 


iG[0,T] 

^C{\y-y'r' + \z-z’r'). 


I —(n+a' —1) 




n+a 


By Proposition 14.181 we deduce that 


u(^’^i(-,mo,p,C) - u(^’^i(-,mo,y',C') 


< 


n+l+Q: 


C{\y-yr' + \c-cr'), 


(144) 


98 





























which provides the last claim in the statement (the L® bound following from (|142p l. 

Now, by Lemma 15.41 (applied with both n and n — 1), we know that, for |A:| ^ n — 2 and 


1 . r /|| 1 / ~C+he,-,A; ~C,k\ ~C,,k+ej 

hni essup^go sup(|-(p 4 ^ 

Oish—>0 L+cfn V il 


R\{0}3h^0 


te[ 0 ,r] 


1 , Q+hej,k _ (;,k\ _ -Ck+Bji 




lln+l+a 


= 0 , 


where ej denotes the vector of the canonical basis of Therefore, by ()140l) . 


lim essup^gf^ 
IR\{0}3fe^0 


1 


sup f||-(6i 
'-te[0,T]^ IT' 

1 


i,k,y,C+hej r£,k,y,(;\ 'ri,k+ej,y,Ci\ 


_ _ 


II — (n+a'—1) 


II / ^£,/c,2/,C+^ej fe,k,y,C\ f^,k-\-ej,y,0 

+ 11/^ Vi It ) Jt I 


n-\-a 


1 f l,k,y,(:+hej _ ~i,k,yX\ _ ,?/,C || 

h \9 t St ) St L+i+a 


= 0 . 


By Proposition 14.181 


lim 


1 


C + hej) - mo, y, C)) - mo, y, C) 


n+l+Q 


= 0 , 


which proves, by induction, that 

T)^u(^’°)(x,mo,y,C) = v^^'^\x,mQ,y,C), x,y,Ce T'^. 
Similarly, we can prove that 

mo, y, C) = mo, y, C), x,y,C^ 

Together with the continuity property (I144p . we complete the proof. 

We claim that 


□ 


Proposition 5.10. We can find a constant C such that, for any mo, mo £ P(T'^), any y,y',C £ 
any multi-indices i, k with \i\, |A:| ^ n — 1, 


mo, y, C) - v^^’^\-,mo,y, C) ^ Cdfimo, m'o). 

n+l+Q: 

Proof. The proof consists of a new application of Proposition 14.181 Given 

• the solutions imt,ut)te[(},T] and (m(, ii()tg[o,T] to (|78l) with mo = mo and ttiq = m'o as 
respective initial conditions. 


• the solutions {dmfht, dmUt)te[o,T] and (^mW-J, dmii()ts[o,T] to (|137p . with {rht,ut)te[o,T] and 
(m(,'u()jg|-o^'r] as respective input and draPiQ = dmuiiQ = (—1)I^IZ)^(5^ as initial condition, 
for some multi-index k with \k\ ^ n — 1 and for some C £ 

• the solutions {pt, zt)te[o,T] and (p(, 50ts[o,T] to (fTF7|), with {rht,ut)te[o,T] and (m(> «t) 4 s[o,T] 

as respective input and as initial condition, for some multi-index £ with \£\ ^ 

n — 1 and for some y 6 
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• the solutions {p\ ,zl )te[o,T] and {Pt i^t )ts[o,T] to the second-order linearized system 
(I138P with Pt, Zt, drnfht, draUt)te[o,T] and {m[,u[,p[,z[,drnm[,drnu't)te[o,T] as respec¬ 

tive input and with 0 as initial condition. 

Notice from p34p that zq = v^^\-,mo,y) and z'^ = 

With each of pt,h,dmmt,d m ut)te[0,T] and p't, z't,drnrh't,drnu't)teio,T], we can 

associate the same coefficients as in (I139p . labeling with a prime the coefficients associated with 
the input {rh^,u^, p'-i.,z'i.,dm'^t,drnUt)te[o,T]- Combining with (|140p and (|141l) . we obtain: 

||Ci — Vl\n+a + llTt — FjIIo + \\b^ — bt ||_(„+q/_i) -I- Wff — f^'\\n+a + WPt ~ grWn+l+a 
^ U-^^n+l+a 4“ \\zt 'Sjln+l+a 4“ SmUt\\n+l+a 

+ di(rht, TTlj ) -|- \\pt — 4" \\draPht — dm'l^t\\ — (n+a' — l)^ ■ 

By Propositions 14.181 and ISTTl (applied with both n and n — 1), we complete the proof. □ 

On the model of Lemma 15.51 we have 

Lemma 5.11. Given a finite measure p on the solution fg (jl32p . with 0 as initial 
condition, when (mf)o<ct<;T is initialized with mo, {pfiomtiT is initialized with (—1)1^1 for 
\^\ ^ n — 1 and y e T'^, and is initialized with p, reads, when taken at time 0, 

z^Q^ : R"* 9 X z^o\x) = [ v‘'^’^\x,mo,y,C)dp{C)- 

JT^i 


Now, 

Proposition 5.12. We can find a constant C such that, for any multi-index I with \^\ ^ n — 1, 
any mo, m'^ e P(T^) and any y e T'^, 




(■,mo,y) -v'^^\-,mo,y) - / v(^’°^(-,mo,y,C)d(mo - mo)(C) 


< 


/jd 


Cdf(mo, 


m'o) 


n+l+Q; 


Proof. We follow the lines of the proof of Proposition 15.61 Given two initial conditions mo, m'^ e 
P(T^), we consider 

• the solutions {mt,ut)te[o,T] and im't,Ut)te[o,T] to with rho = mo and rhg = m^ as 
respective initial conditions, 

• the solution {dmizit, dmUt)te[o,T] to (|132p . when driven by the input {mt,ut)te[o,T] and by 
the initial condition dmiho = m'^ — mo, 

• the solutions {pt,zt)ts[o,T] and (pj, 5()is[o,T] to (fT37P, with {mt,ut)te[o,T] and {m[,u'f)te[o,T] 

as respective input and as initial condition, for some multi-index £ with \l\ ^ 

n — 1 and for some y e T'^, 

• the solution {p\ ’ ,zl 0ts[o,T] to (|138p with {rht, ut, pt, zt, dmrht, drnUt)te[o,T] as input and 0 
as initial condition. 
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Then, we let 

= P't-Pt- pf ^, =z[-zt- zf ^, t 6 [0, T]. 


We have 

dtibzf^) = {-A( 55 f) + (DpHt{;Dut),D{6zi^^)} - + ft}dt + dMt, 

dt{5pf) - ) - d\w[{5pf'^)DpHt{-,Dut)] - d\Y[mtDl,Ht{-,Dut){D5~zf'^) + k] = 0, 

with a boundary condition of the form 




6G 


„(2)' 


where 


It = Pt(^DpHt{-,Dut) - DpHt{-,Dut)^ + (^m[DppHt{-, Du't) - rhtDppHt{-, Dut)^ Dz't 

dm'^tdd^ppddfi^'^ Dut^Dzt ptDppHt^'•, Dut^DdYa^t dhtDpppddt(^'^ G)ut)Dzt (E) dDdYn'^ti 

ft = {DpHt{-,Du[) - DpHt{-, Dut), Dziy - {D'^pHti-, Dut), Dzt ® DdmUt) 

fSFt, 6Ft , 6‘^Ft 


dm? 

~ Mi M VM 

9T = -^V^^tAPt) - —y^^TApT) - 


6m 


5m^ 


{■,mT){pT,dmmT), 


and where {Mt)te[o,T] is a square integrable martingale as in (11071) . 
Therefore, 


bt = {p't - Pt) {DpHt{; Du[) - DpHt[;Dut)) 

+ pt(^DpHt{-,Du[) - DpHt{-,Dut) - {Dy^pHt{-,Dut),DdmUt)^ 

+ {^tA>lpHt{-, Du[) - rhtDlpHti-, Dut)j [Dz[ - Dzt) 

+ {m't - rht^ (^DppHt{-, Du^) - DppHt{-,Dut)^Dzt 

+ (m^ -rht- dmmt^Dy,pHt{-,Dut)Dzt 

+ mt{py,pHt{-, Du[) - Dy,pHt{-, Dut) - Dy^Ht{-,Dut)DdmU?jDzt, 


and 


ft 


(DpHt{-,Dut) - DpHt{-,Dut),Dzt - Dzt'^ 

+ (DpHt{-,Du[) - DpHt{-,Dut) - DppHt{-,Dut)DdmUt,Dzt'^ 




6^Ft 

dm^ 


(•, 
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Similarly, 




Applying Theorem 14.31 LemmaPropositions 15.61 and 15.71 and (I140p and (11410 and using the 
same kind of Taylor expansion as in the proof of Proposition 15.61 we deduce that : 


SG 


6^G 


essup^gj^ sup 
te[0,T] 


II II — + —1) T ll/tlln+o; + l|ffTl|n+l+a ^ Cdi^UlQ, TTIq) 


By Proposition 14.18l we complete the proof. 
We thus deduce: 


□ 


Proposition 5.13. For any x e T'^, the function P(T‘^) 3 m i— > 17(0, x, m) is twice differentiable 
in the direction m and the second-order derivatives read, for any m e P(T'^) 

6‘^U 

-^{0,x,m,y,y') = v^°'°\x,m,y,y'), y,y' 

In particular, for any a' e (0, a), t e [0, T] and m e P(T^), the function [6‘^U/6m‘^]{0, -, 171 , •, •) 
belongs to (j"+i+«'(']r'^) x x the mapping 

fijT 

P(T'^) 9 m ^ ^^(0, •, m, •, •) e C"+i+«'(T'^) x x 

om‘^ 

is continuous (with respect to di). The derivatives in y and y' read: 


DiD. 


6^U 


^ dm? 


(0, X, m, y, y') = v^^’'"\x, m, y, y'), y, y' e T'^ 


|A:|, \I\ ^ re — 1. 


Proof. By Proposition 15.121 we indeed know that, for any multi-index i with \I\ ^ re — 1 and 
any x,y e the mapping P(T'^) 3 m ^ Dy\5U / x, rre, y) is differentiable with respect to 

rre, the derivative writing, for any rre 6 P(T'^), 


5m 


Di 


,5U 

'6m 


(0, X, m, y, y') 


= 


(0,x,rre,y,y'), 


y,y 


e 


By Lemma [531 [^/^'m][Dy[6U/6m]]{0,x,m,y,y') is re — 1 times differentiable with respect to 
y' and, together with Proposition 15.101 the derivatives are continuous in all the parameters. 
Making use of Schwarz’ Lemma 12.41 the proof is easily completed. □ 


Following Proposition 15.81 we finally claim: 

Proposition 5.14. Proposition 1 5. 1 7] easily extend to any initial time to £ Then, for any 

of e (0, a), any to e [0, T] and mo e 


lim sup sup 
|fc|^n —1 |£|^n—1 


y y' 5 m? 


(to + h,-,mo,-) - DyD^ 


5‘^U 


y^y' £ 0 - 

^ " orre^ n+l+a',a',a' 
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5.4 Proof of Theorem 12.111 


We now prove Theorem 12.111 Of course the key point is to prove that U, as constructed in the 
previous, subsection is a solution of the master equation (f35]l . 

5.4.1 Regularity Properties of the Solution 

The regularity properties of U follow from Subsections 15. 11 and (5^ see in particular Propo¬ 
sitions [5T3] and [5T4] (pay attention that, in the statements of Theorem 12.111 and of Proposition 
15.131 the indices of regularity in y and y' are not exactly the same). 


5.4.2 Derivation of the Master Equation 

We now have all the necessary ingredients in order to derive the master equation satisfied by 
[7. The first point is to recall that, whenever the forward component {fnt)te[to,T] 1I128P is 
initialized with tuq 6 P(T'^) at time toj then 

U{tQ,x,mo) = ut^ix), xeT*^, 


{ut)te[tQ T] denoting the backward component in (I128p . Moreover, by Lemma [5Tl for any h e 
[T - to], 

uto+h(x) = U(to + h,x + V2(Wt^+h - Wto),mto,to+h), x e T'^, 

where mt^^t the image of fht by the random mapping 3 x x -I- \f2(Wt — Wt^) that is 
xnto,t = [id + -\/2{Wt — WtQ)]‘^rht. In particular, we can write 

U{to + h,x,mo) - U{to,x,mo) 
h 

E[[/(fo + h,x + y/2{WtQ+h - Wto),mto,to+h)] “ U{to,x,mo) 


[/(to + h,x,mo) -¥.[11 {to + h,x + V2{Wto+h - fPio), m-to/o+Zi)] 
^ h 

¥[uto+h{x)] - utoix) 


(145) 


U{to + h, X, mo) - ¥[U{to + h,x + V2{WtQ+h - Wio) 

^ h ■ 

We start with the first term in the right-hand side of (11450 . Following (|83ll . we deduce from the 
backward equation in (jl280 that, for any x 6 T^, 

dt[¥{ut{x))] =E {-Aut + Hto,t{-,Dut) - Fto,t{-,mto,t)}{x) dt, 


where the coefficients Ft^^t and Hto,t are given by (11291) . In particular, thanks to the regularity 
property in Corollary 15.21 we deduce that 

lim ^ -AxU{to,mo,x) + H{x,DJJ{to,mo,x)) - F{x,mo). (146) 

h\o n 

In order to pass to the limit in the last term in (jl45p . we need a specific form of Ito’s formula. 
The precise version is given in Lemma 15.151 below. Applied to the current setting, with 


/3t(-) = DpH{-,D^U{t,-,mto,t)), 
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it says that 


lim —E 

h\o h 


U(to + h,x + V2{Wto+h - Wto),mto^to+h) -U{to + h,x,mo) 
= ^xU{to,x,mo) 

+ 2 


/ dwy[D„iU]{to,x,mo,y)dmo{y) 

Jjd 

DmU (to, X, mo,y)DpH[y, DU{to, y, mo))dmo{y) 

jd 

/ diYx[DmU]{to,x,mo,y)dmo{y) 

Jjd' 


( 147 ) 


[ Tr 

DlirnU{to,x,mo,y,y') 

/[Td]2 



From (I146P and (|147p . we deduce that, for any (x,mo) e x the mapping [0,T] 9 t 

U{t,x,mo) is right-differentiable and, for any to ^ [0,^), 


lim 

h\0 


U (to -f h, x, mo) - U (to, X, mo) 

h 


= -2A^U{to,x,mo) + H{x,D^U{to,x,mo)) - F{x,mo) 

-2 divy[DmU]{to,x,mo,y)dmo{y) 

JTd- 

+ / DmU{to,x,mo,y)DpH{y,DU{to,y,mo))dmo{y) 

JTd- 


- 2 


/ diVx[DmU]{to,x,mo,y)dmo{y) - Tr dI^^U {to, x,mo,y,y) dmo{y)dmo{y). 
Jjd J[T‘*]2 L 


Since the right-hand side is continuous in {to,x,mo), we deduce that U is continuously differen¬ 
tiable in time and satisfies the master equation (1351) . 


5.4.3 Uniqueness 

It now remains to prove uniqueness. Considering a solution V to the master equation (|35l) along 
the lines of Definition 12.101 the strategy is to expand 

= V{t,x + V2Wt,m[), te[0, r], 

where, for a given initial condition mo e V{T'^), m{ is the image of m( by the mapping 9 x i--> 
X -I- V2Wt, (™t)te[o,T] denoting the solution of the Fokker-Planck equation 

dim'-f. = IAm( -I- div{m[DpHt{-, D^V{t, x + \/2Wt, m())) jdt, 

which reads, for almost every realization of {Wt)te[o,T]: the flow of conditional marginal 

distributions (given {Wt)te[o,T]) of the McKean-Vlasov process 

dXt = -DpHt{Xt,D,V{t,x + V2Wt,C{Xt\W)))dt + V2dBt, te[0,T], (148) 

Vo having mg as distribution. Notice that the above equation is uniquely solvable since D^V 
is Lipschitz continuous in the space and measure arguments (by the simple fact that D^V and 
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DmDxV are continuous functions on a compact set). We refer to [G^ for standard solvability 
results for McKean-Vlasov SDKs (which may be easily extended to the current setting). 

Of course, the key point is to prove that the pair -u()ig[o^r] solves the same forward- 
backward system ([78]) as {rht, ut)te[o,T] > ™ which case it will follow that V (0, x, mo) = Uq = uq = 
C/(0,re,mo). (The same argument may be repeated for any other initial condition with another 
initial time.) 

The strategy consists of a suitable application of Lemma [5T5] below. Given 0 ^ t ^ t+h ^ T, 
we have to expand the difference 

E[V{t + h,x + V2Wt+h, | J)] -V{t,x + \f2Wt, m[) 

= E[V(t + h,x + V2Wt+h,m[+f,) |T)] -V{t + h,x + V2Wt, m[) 

(149) 

+ V{t + h,x + V2Wt,m[) -V{t,x + V2Wt,mt) 

= sl,, + sl^. 

By Lemma 15.151 below, with 


it holds that 

SU = h 


Aj;V{t,x + \/2Wt,m[) + 2 [ diVy[DmV]{t,x + V2Wt,m[,y)dmi{y) 
- [ DmV{t,x + V2Wt,m^,y) ■ DpH{y,D^V{t,y,m[))dm't-{y) 


-- (150) 

+ 2 div^[DmV]{t,x + V2Wt,m[,y)dm't{y) 

+ / ^\Dl,rn^]{t,x + V2Wt,m't,y,y')dm).{y)dm[{y') + et,t+h , 

J[Td]2 L J 

where {£s,t)s,te[to,T]-.s!S,t is a family of real-valued random variables such that 

lim sup IE[|es^i|] = 0. 

s,tG[to,T']:|s— 

Expand now 5^^ in (I149D at the first order in t and use the fact that dtV is uniformly continuous 
on the compact set [0,T] x T'^ x T’ 2 (T‘^). Combining (|149p . (|15n|) and the master PDE ([35]) 
satisfied by V, we deduce that 

E[V {t + h,x + V2Wt+h, "lU/i) \^t] -V{t,x + V2Wt,m[) 

- -h\A y{t,x + V2Wt,m[) - H{x + V2Wt, D^V{t,x + V2Wt,m[)) 

/TTytt /\ . 


F{x + V2Wt, m[) + et,t+h 


Considering a partition t = to < ti < ■ ■ ■ < tj\f = T oi [t,T] of step size h, the above identity 
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yields 


E[G{x + V2WT,m'T) - V{t,x + V2Wt,m[)\Tt] 

N-l 

= -hYj \^xy{ti,x + \/ 2 Wt^,m[^) - H[x + V 2 Wt„ D^V{ti,x + V 2 Wt^,m[.)) 
1=0 ^ 

+ F{x + \/2Wt.,m[J 

iV-l 

i =0 


Since 


lim sup 

/i\0 


sup 

r,s£[ 0 ,T]:|r—s|^/i 




^ lim sup sup E[|er^ 5 |] = 0, 

/i \0 r,s£[ 0 ,T]:|)— s\^h 


we can easily replace each E,[st^^ti+h\J^t] by £ti,ti+h hself, allowing for a modification of £ti,ti+h- 
Moreover, here and below (c/. the proof of Lemma 15.151) . we use the fact that, for a random 
process (7t)tg[o,T]) with paths in C'^([0,T],M), satisfying 


essup^gj^ sup | 7 t| < 00 , 

t£[ 0 ,T] 


(151) 


it must hold that 


1 /■* 

lim sup E[| 7 s,t|] = 0, %,* = i- 77 {lr-ls)dr, (152) 

I'S 't\ Js 


the proof just consisting in bounding \rjs^t\ by Wj{h), where stands for the pathwise modulus 
of continuity of (7t)iE[o,T] > which satisfies, thanks to (jl5ip and Lebesgue’s dominated convergence 
theorem. 

Vim E[w^{h)] = 0. 

n\(j 

Therefore, allowing for a modification of the random variables St^^a+h^ for i = 0,..., — 1, we 

deduce that 


E[G(x +V21Tr,mr) - V{t,x + V2Wt,mt)\d^t] 

= - ^ AxV{s,x + V2Ws,m'g) - H[x + V2Ws,DxV{s,x + V2Ws,m'g)) 
+ F(x + V^Ws^m'g) 


ds 


N-l 

h ^ Sti,ti+h¬ 


i =0 


Letting, for any x e T'^, 

M[{x) = V{t,x + \PiWt-, m[) 

AxV (s, x + V2Ws,m'g') — H(^x + v^VLs, D^V (s, x + a/2VLs, m^)) 


f: 


+ F[x + V2Ws,mg) 


ds, 
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we deduce that 


N-l 

E[M^{x) - Mi{x)\Tt] =hj] euM+h. 

i=0 

Now, letting h tend to 0, we deduce that [M[[x))te[o,T] is a martingale. Thanks to the regularity 
properties of V and its derivatives, it is bounded. 

Letting 

vt(x) = V(t,x + V2Wt,mt), te[0,r], 

we finally notice that 

T 

Vt{x) = GT{x,m'rp) + [A^Vsix) - Hs{x, Dvs{x)) + F{x,m'g)]ds - - M/)(x), t e [0,r], 

which proves that Ml;)te[o,T] solves dZH]). 


5.4.4 Tailor-made Ito’s Formula 

Let 17 be a function satisfying the same assumption as in Definition 12.101 and, for a given 
to e [0,T], {'mt)te[tQ,T] be an adapted process with paths in C^{[to,T],V{T'^)) such that, with 
probability 1, for any smooth test function ip 6 C”(T'^), 


/ ip{x)dmt{x) 

Jjd 


Ijd 


[Aip{x) - {(5t{x + \f2{Wt - lTfJ),D(/?(x)>]dmi(x) idt, t e [to,T], 


(153) 


for some adapted process {l3t)te[to,T]j with paths in C'^([to,T'], such that 


essup^gQ sup ll^tllo < 00, 

te[to,T] 

SO that, by Lebesgue’s dominated convergence theorem, 

lim E[ sup WPs - PtWo] = 0. 

s,te[0,T],\t-s\sih 

In other words, {mt)te[to,T] stands for the flow of conditional marginal laws of iXt)te[to,T] given 
Ft, where iXt)te[tQ,T] solves the stochastic differential equation: 

dXt = -Pt{Xt + V2{Wt - Wt,))dt + V2dBt, t 6 [to,T], 

XtQ being distributed according to conditional on Ft- In particular, there exists a deter¬ 
ministic constant C such that, with probability 1, for all to ^ ^ ^ ^ ^ ^ 

di{mt+h,mt) ^ C\fh. 

Given some t e [to;?"], we denote by rui = (• i—> • -I- \f2iyVt — Wtf)))'^rht the push-forward of m-t 
by the application 3 X X + Wt — WtQ e (so that mtQ = mt^). 


We then have the local Ito-Taylor expansion: 
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Lemma 5.15. Under the above assumption, we can find a family of real-valued random variables 
s,te[to,T]:s^t such that 

lim sup = 0, 

s,te[to,T]:|s—i|^/i 

and, for any t e [to,T], 


1 

h 


E[U{t + /i,x + \/2(Wt+h - Wto),nit+h) - U (th, X\/2(Wt+h - Wto),mt)\Tt] 
A,^U{t,x V 2 (Wt-Wto),mt )2 f divy[DmU]{t,x V 2 (Wt - Wto),mt,y)dmt{y) 

Jjd 

- f DmU{t,xV2(Wt-Wto),mt,y) ■ fitiy)dmt{y) 

+ 2 f div,,[DmU]{t,x + V2{Wt -Wto),nit,y)dmt{y) 

{t,x + V2{Wt - Wtfi),mt,y,y')dmt{y)dmt{y') +£t,i+h- 


[ Tr 

U 

/[Td]2 



Proof. Without any loss of generality, we assume that to = 0- Moreover, throughout the analysis, 
we shall use the following variant of (|152p : For two random processes (7t)tg[o,T] (7i)te[o,T]) 

with paths in C°([0, T],C^{E)) and C°([0, T],F) respectively, where E is a compact metric space 
(the distance being denoted by dE) and F is a metric space (the distance being denoted by dp), 
satisfying 


essup^gj^ sup ||7t||o < 00, 

ts[0,T] 


it must hold that 


lim sup Eriryo+n =0, rist= sup sup l7r(7)—7s(a^)l- (154) 


Now, for given t e [0,r) and h e (0,T —t], we let ShWt = Wt+h — Wt and 5hmt = m*. 

By Taylor-Lagrange’s formula, we can find some random variable A with values in [0,1] □ such 
that 


U{t + h,x + V2Wt+h, mt+h) -U{t + h,x + V2Wt,mt) 

f su 

= V2DxU (t + h, X + V2Wt, mt) ■ 5hWt + 


6tr 


(t + /i,x + V2Wt,mt,y)d{Shmt){y) 


+ Dlu{t + h,x + V2Wt + V2\5hWt, mt + XS^mt) • (4^)®" 

C SLf 

+ V2 D^ — {t + h,x + \p2Wt + \/2X5hWt, mt + XSnmt, y) ■ 6hWtd{6hmt) (y) 
Jjd om 


(155) 




6 ^U 

5m^ 


{t + h,x + \/2Wt + V2X5hWt,mt + X5hmt,y,y')d{5hmt){y)d{5hmt){y') 


— J-U 


n~'3 


Tl 


® The fact that A is a random variable may be justified as follows. Given a continuous mapping tp from 
X ViT'^) into R and two random variables {X,m) and {X',m') with values in (R‘^,F(T‘^)) such that the 
mapping [0,1] 3 c v{cX' + (1 — c)X, cm' + (1 — c)m) vanishes at least once, the quantity A = inf{c e [0,1] : 
<p{cX' + (1 — c)X, cm' + (1 — c)m) = 0} defines a random variable since {A > c} = n„gis[\{o} cic'eQe[o,c] {‘p{c'X' + 
(1 — c')X, dm' + (1 — c')m)tfi{X, m) > 1/n}. 
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where we used the dot to denote the inner product in Euclidean spaces. Part of the analysis 
relies on the following decomposition. Given a bounded and Borel measurable function (/j : T'^ —> 
M, it holds that 


/ Hy)d{5hmt){y) 

JT<i 

ip{y)dmt+h{y) - / ip{y)dmt{y) 


[ ip{y + V2Wt+h)dmt+h{y) - [ (fiy + V2Wt)drht{y) (156) 

Ijd J'ld 

(p{y + V2Wt+h) - ipiy + V2Wt) dmt{y) 

(p{y + V2ShWt) - (p{y) dmt{y). 


/ y^{y + V2Wt+h)d{mt+h-'mt){y) + 

/-fd JT'^l 


= / y^{y + \f^Wt+h)d{mt+h-mt){y) + 

J-fd 

In particular, whenever (/? is a bounded Borel measurable mapping from into M, it holds 

that 


[ y^{y,y')d{Shmt){y)d{6hmt){y') 

J[Td]2 

= / ip(y + \/2Wt+h,y')d(mt+h - mt){y)d(6hmt){y') 
J[Td]'2 

+ / ip{y + V26hWt,y') - (p{y,y') dmt{y)d{5hmt){y' 


'[T'i]' 


99 ( 2 / + \f2Wt+h-,y' + \/2Wt+h)d{fht+h - rht){y)d{fht+h - fht){y') 


(157) 


/ 

J\j 

[ 

J[T 

[ 

J\T 


r[Td]2 

f 

I [Td ]2 

f 

r[Td]2 


ip{y + \/2Wt+h, y' + V 24 Wt) -y){y + ^Wt+h, y') d{mt+h - mt) {y)dmt{y') 
^p{y + \/25hWt, y' + V2Wt+h) - y^{y, y' + V2Wt+h) dmt{y)d{mt+h - mt) {y') 
ip{y + V26hWt, y' + ^b^Wt) - y^{y + ^24^*, y') 

- ‘f{y,y' + V25hWt) + (piy,y') dmt{y)dmt{y'). 

We now proceed with the analysis of (|155l) . We start with T^. It is pretty clear that 

l^[Tl\J^t] = 0. (158) 

Look at now the term T^. Following (I156p . write it 


n -<2 _ 

- 


ijd 


6 m 


{t + h,x + V2Wt, mt,y + V2Wt+h)d{rht+h - mt) (y) 


f 

JTd 


su 

— {t + h,x + V2Wt, mt, y + 42414) 
6U 


(159) 


- —{t + h,x + V2Wt,mt,y) dmt{y) 


_ ^ 2,1 . rT-]2,2 

~ -^h ■ 
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By the PDE satisfied by {mt)te[to,T]j we have 

T^2,1 _ 


rt - i-ti n gjj 

’^ = / ds Ay — (t + h,x + V2Wt,mt,y + V2Wt+h)drhs{y) 

Jt 

rt+h f- 

/ ds Dy — (t + h,x + V2Wt,mt,y + V2Wt+h)-l3s{y + V2Ws)dms{y). 

Jt ■/T'i om 


( 160 ) 


Therefore, taking the conditional expectation, dividing by h and using the fact that mt is the 
push-forward of rht by the mapping T'^ 3 x x + \f2Wt (pay attention that the measures below 
are nit and not m^), we can write 

iE[T^’^|J't] = Ay^{t,x + V2Wt,mt,y)dmt{y) 

- / DmU{t, x + V2Wt, mt, y) ■ I3t{y)dmt{y) + et^t+h, 

where, as in the statement, (es,t)o^s^t^r is a generic notation for denoting a family of random 
variables that satisfies 

lim sup E[|es,i|] = 0. (161) 

Here we used the same trick as in (jl54l] to prove (I16ip (see also (I152IB . Indeed, by a first 
application of ()154p . we can write 




/ t+n r gjj 

ds J ^ Ay —[s,x + V2Ws, ms, y + ^/2Ws)dms{y) 

rt+h r 

/ ds Dy — {s,x + V2Ws, ms, y + V2Ws) ■ Ids {y + V2Ws)dms{y) + hst^t+h- 

Jt Jj’t am 


Then, we can apply (|154|] once again with 


5U 


7s (x) = / Ay—(s,x + V2Ws,ms,y + V2Ws)dms(y) 

Jrd dm ^ 

c su 

- / Dy—[s,x + V2Ws,ms,y + V2Ws)-l3s{y + V2Ws)dms{y). 
Jjd dm 

Using Ito’s formula to handle the second term in (I159D . we get in a similar way 
1 f SU 

-E[r^|J'i] = 2 / Ay—(t,x + V2Wt,mt,y)dmtiy) 
ti Jfd dm 

- / DmU(t, X + V2Wt, mt, y) ■ I3tiy)dmt{y) + St,t+h- 
Turn now to in (|155p . Using again (11541) . it is quite clear that 


(162) 


^E[r^|7)] = Ar,U{t,x + V2Wt,mt) + et,t+h- 


(163) 
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We now handle T^. Following (|156p . we write 


Tt 


V2 [ 


6U 

+ h,x + V2Wt + y/2X6hWt, rnt + XShfnt, y + \/2Wt+h) 


ShWt 


d{'mt+h - 'mt){y) 


C r SU 

V 2 J ^ Dx — {t + h,x + V2Wt + V2X6hWt, mt + X6hmt, y + V2ShWt) 


6U 


- Dx—(t + h,x + V2Wt + V2XdhWt, mt + X5hmt, y) ■ 6hWtdmt{y) 


_ 0^4,1 . rT-i4,2 

~ -^h • 


Making use of the forward Fokker-Planck equation for {mt)ts[tQ,T] S'® in the proof of (I162p . we 
get that 

Now, by Taylor-Lagrange’s formula, we can find another [0, l]-valued random variable A' 
such that 


Tt" = 2 


err 

^ DyDx — {t + h,x + V2Wt + V2X5hWt, mt + X6hmt, y + V2X'6hWt) 

L uTTl 


{5hWt)’^^]dmt{y). 


And, then. 


1 


1 


-E[T^\Tt] = -E[T;:’^\Tt]+et,t+h 


= 2 


r su^ 

J ^ diYy[Da;—] {t,x + V2Wt,mt,y)dmt{y) + et,t+h (164) 


6U- 


2 / div^[Dy—](t,x + V2Wt,mt,y)dmt{y)+£t,t+h- 

Jjd dm 
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It finally remains to handle T^. Thanks to (|157p . we write 




\L 


\L 


5^U 

^jd-^2 Sw?' 

5^U 


[Td]2 _ 

6^U 


{t + h,x + V2Wt + V2X6hWt, mt + XShmt, y + \f2Wt+h-, y' + \f2Wt+h) 

d{mt+h - mt){y)d{mt+h - rht){y') 

^^2 a; + V^Wt + ^\6hWt, mt + XShmt, y + V26hWt, y' + V2Wt+h) 


5m^ 


{t + h,x + V2Wt + \f2X6hWt, mt + X6hmt, y, y' + \f2Wt+h) 


dmt{y) 

d{mt+h - mt){y') 


5^U 


( 5^2 {t + h^x + V2Wt + V2X6hWt, mt + X6hmt, y + V2Wt+h, y' + \f25hWt) 
a: + V2Wt + ^X5hWt, mt + X6hmt, y + V2Wt+h, y') 


r[Td]2 _ 

5‘^U 


IL 


[Td]2 _ 

6^U 


dm^ 

6^U 

dm^ 

d^U 

dm^ 


d{mt+h - mt){y) 
dmt{y') 

( 5^2 + + V2Wt + ^XdhWt, mt + Xdhmt, y + V2dhWt, y' + ^dhWt) 

{t + h,x + V2Wt + ^XdhWt, mt + Xdhmt, y + V2dhWt, y) 


d^U 


(t + h,x + V2XdhWt, mt + Xdtmt, y, y' + ^dhWt) 


{t + h,x + V2Wt + ^XdhWt, mt + Xdhmt, y, y') 


dmt{y)dmt{y') 


+ + + r«). 


(165) 


Making use of the Fokker-Planck equation satisfied by {mt)te[tQ,T] together with the regularity 
assumptions of d’^Ujdm? in Definition 12.101 it is readily seen that 


E[r« + rf+ 


5 4 

Focus now on r^’ . With obvious notation, write it under the form 

^ 5,4 _ ^ 5 , 4,1 _ ^ 5 , 4,2 _ ^ 5 , 4,3 ^ 5 , 4,4 


(166) 


(167) 
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Performing a second-order Taylor expansion, we get 


T'5,4,1 


f + h,x + V2Wt + V2\6hWt,mt + X6hmt,y,y')dmt{y)dmt{y') 

j[Td ]2 dm 


+ f V2Dy^-^[t + h,x + V2Wt + V2X6hWt,mt + X6hmt,y,y') ■ 5hWtdmt{y)dmt{y') 
J\jd-^2 dm 


6^U 


+ I \f2Dyij^{t + h,x + V2Wt + V2X6hWt,mt + X6hmt,y,y') • 5hWtdmt{y)dmt{y') 


'[T4]2 


.5‘^U 


+ / + h,x + V2Wt + V2X6hWt, mt + Xdtmt, y, y) ■ {5hWt)®‘^dmt{y)dmt{y') 


f[Td]2 5m^ 

6^U 
5m^ 


+ h,x + \f2Wt + V2XShWt,mt + X6hmt,y,y') • {dhWt)^ dmt{y)dmt{y') 

f 2DyDyf^-^{t + h,x + V2Wt + V2XShWt,mt + X5hmt,y,y') ■ {dhWt)®"^dmt{y)dmt{y') 
Jndvi dm 


'[T''] 

+ £t,t+h 


- Th’ ’ + + Ih + Jh + Jf^ + J^’ -I- het^t+h- 


Similarly, we get 


7-'5,4,2 _ ^5,4,4 i rl i t 1 i 

J^h - +-^h + dh + det,t+h, 

r7-t5,4,3 r7-i5,4,4 I t2 I t2 I u 

Th ~ Th + dh + Th + hSht+h, 


from which, together with (|167|1 . we deduce that 

n5A _ jl,2 


Tr = Jh' +het,t+h, 


and then, with ()166p . 


-PEpf-IJ-J+ew+i. 


f Tv\DyDy,^-^(t,x + V2Wt,mt,y,y)]dmt{y)dmt{y') + St,t+h- 
J\fdr2 L dm J 


r[Tti]2 

From (I155p . (jl58p . (I162h . (I163p . (I164p and pi69p . we deduce that, 
1 


h 

= A 


'[T4]2 

which completes the proof. 


E [[/{t + h,x + y/2Wt+h, mt) — U{t + h,x + \f2Wt^ mt) \Tt\ 

U{t,x + \/2Wt,mt)+ 2 I dwy[DraU]{t,x + V2Wt,mt,y)dmt{y) 

/ DmU{t, X + V2Wt, mt, y) ■ I3t{y)dmt{y) 

JT<i 

2 div,^[DmU]{t,x + V2Wt,mt,y)dmt{y) 

JTd 

/ Tv D‘^^U(t,x + V2Wt,mt,y,y') dmt{y)dmt{y') + St^t+h, 

J\fdl2 L J 


(168) 


(169) 


□ 
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5.5 Proof of Corollary 12.1^ 

We are now ready to come back to the well-posedness of the stochastic MFG system 

( dtUt = {—2Aut + H{x,Dut) — F{x,mt) — y/2div{vt)}dt + vt ■ V2dWt, 

•I dtrrit = \2Amt + d\\{rntDpH{x,Dut))\dt — '\/2div(mtdTW), in \tQ,T^ x T*^, (170) 

[ mtQ = mo, ut{x) = G{x,mT) in T'^. 

For simplicity of notation, we prove the existence and uniqueness of the solution for to = 0. 

First step. Existence of a solution. We start with the solution Mt)te[o,T] to the 

system 

f dtfht = {Amt + di\{mtDpHt{-,Dut))]dt, 

< dtUt = {-Aut +Ht{-,Dut) - Ft{-,mt)]dt + dMt, ( 171 ) 

[ mo = fno-, ut{x) = G{x,mT) in 

where Ht{x,p) = H{x + ^/2Wt,p), Ft{x,m) = F{x + V2Wt,m) and G{x,m) = G{x + \/2 Wt, m). 
The existence and uniqueness of a solution Mt)te[Q,T] to (jl71l) is ensured by Theorem 

Ol Given such a solution, we let 

ut{x) = ut{x - \/2Wt), X e ; mt = [id + \/2Wt)'^mt, t6[0,r]. 


and claim that the pair (rtt, mt)tg[o,T] thus defined satisfies (jl70p (for a suitable {vt)te[o,T])- 

The dynamics satisfied by {mt)te[o,T] given by the so-called Ito-Wentzell formula for 
distributed-valued processes, see [431 Theorem 1.1], the proof of which works as follows: for any 
test function (f £ C^(T‘^) and any z e we have f^d 4’{x)dmt{x) = fjd4>{x + V2Wt)dmt{x); 
expanding the variation of (f,jj,d 4>{x + z)drht{x))i^^o,T] by means of the Fokker-Planck equation 
satisfied by {mt)te[Q,T] and then replacing z by \f2Wt, we then obtain the semi-martingale 
expansion of (fjd 4>{x + V2Wt)dmt{x))te[o,T] by applying the standard Ito-Wentzell formula. 
Once again we refer to [431 Theorem 1.1] for a complete account. 

Applying [43 [ Theorem 1.1] to our framework (with the formal writing (mt(x) = rht{x — 
V2Wt))te[o,T])^ this shows exactly that {mt)te[o,T] solves 


dtmt = |2Amt -I- div(^DpHt[x — V2Wt, Dut{x — \/2Wt))^ jdt — \/2di\{mtdWt) 
= ^2Amt + div(^DpH[x, Dut{x))^ ^dt — V2div{mtdWt). 


(172) 


Next we consider the equation satisfied by iut)te[o,T]- Generally speaking, the strategy is 
similar. Intuitively, it consists in applying Ito-Wenztell formula again, but to {ut{x) = ut{x — 
V2Wt))te[o,T]- Anyhow, in order to apply Ito-Wentzell formula, we need first to identify the 
martingale part in (fit(x))jg[o,T] (namely iMtix))te[o,T])- Recalling from Lemma [5T] the formula 


ut{x) = U(t,x + V2Wt,mt), t6[0,r]. 


we understand that the martingale part of (Ft(x))ig[o,T] should be given by the first-order expan¬ 
sion of the above right-hand side (using an appropriate version of Ito’s formula for functionals 
defined on [0,T] x T'^ x V{T'^)). For our purpose, it is simpler to express ut{x) in terms of U 
directly: 

ut{x) = U{t,x,mt), t6[0,r]. 

The trick is then to expand the above right-hand side by taking benefit from the master equation 
satisfied by U and from the tailor-made Ito’s formula given in Lemma 15.151 
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In order to apply Lemma 15.151 we observe that, in {U{t,x,mt))te[o,T]-, the x-dynamics are 
entirely frozen so that we are led back to the case when U is independent of x. With the same 
notation as in Lemma 15.151 we then get 

E[U{t + h,x,mt+h) -U{t + h,x,mt)\J='t] 


= 2 


/ dwy[DmU]{t,x,mt,y)dmt{y) 
Jjd 


(173) 


- / DmU{t,x,mt,y) ■ DpH{y,Dut{y))dmt{y) 

Jjd 

+ / T^^[DmmU]it,x,mt,y,y')dmt{y)dmt{y') + £t,t+h- 
i[Trf]2 

Of course, this gives the absolutely continuous part only in the semi-martingale expansion of 
{U{t, X, mt))te[o,T] ■ III order to compute the martingale part, one must revisit the proof of Lemma 
15.151 Going back to (I155h . we know that, in our case, T^, and are zero (as everything 
works as if U was independent of x). 

Now, denoting by {rjs,t)s,te[o,T\.s!it ^ family of random variables satisfying 


lim i sup E[|7/,,t|2] = 0, 

/i\0 tl s,te[0,T]:\s-t\sih 


(174) 


we can write, by (I159p and (IlGOp : 

^ Dy^{t,x,mt,y)dmt{y)^ ■ 6hWt -f yt, 


Moreover, by (I165p and (|168p 
proving that 


dt^t+hi 


U{t + h,x,mt+h) - E[U{t -t h,x,mt+h)\J^t] 

= V 2 (^J^ Dy^{t,x,mt,y)dmtiy)^ ■ dhWt + r]t,t+h, 

for some family ir]s,t)s,te[o,T]:siit that must satisfy pi74p . With such a decomposition, it holds that 
^['nt,t+h 1^7] = 0. Therefore, for any t e [0, T] and any partition 0 = ro < ri < r 2 < • • • < tat = t, 
we have 

7V-1 


■ 1 ^ 

~ f f SJ_J \ 

Ti y^yJ^^Dy — {ri,x,mri,y)drnri{y)j • - hLrJ + r]ri,ri 


i=0 
7V-1 


i=0 L 

with the property that 

I ] ^ 71ri,ri+i l^i+l ~ 

where lim/i\^osup(s^ 4 )g|-o,T] 2 :|s-t|^/i = 0. By a standard computation of conditional expecta¬ 

tion, we have that 

























where <5 stands for the mesh of the partition tq, ri,..., tat. As a consequence, the following limit 
holds true in L^: 


N-l 

lim V () =V2 DmU{s,x,ms,y) ■ dWg. 
-5X0 V ^ Jo 


Together with ()173l) . we deduce that 


dtut{x) = -I dtU{t,x,mt) 

/ 2dwy[D„iU]{t,x,mt,y) - DmU{t,x,mt,y) ■ DpHt{y,Dut{y)) dmt{y) 


Ijd jjd 


^[DmmU]it, X, mt, y, y’)dmt{y)dmt{y’) \dt 


Letting 


V2( I ^DmU{t,x,mt,y)dmt{y)] ■ dWt. 


vt{x) = / DmU{t,x,mt,y)dmt{y), t6[0,T], x e T'^, 


and using the master equation satisfied by U, we obtain therefore 

dtUt{x) = 2Ant(x) + H (x, Dut{x)) — F{x, mt) — V^div(vt{x)) | dt + Vt{x) ■ \f2dWt- 


Together with (11721) . this completes the proof of the existence of a solution to (I170p . 

Second step. Uniqueness of the solution. We now prove uniqueness of the solution to (11701) . 
Given a solution {ut,mt)te[o,T] (with some (ni)tg[o,r]) to (|170p . we let 


ut{x) = ut{x + V2Wt), xeT'^, rht = {id - V2Wt)‘^mt, te[0,r]. 


In order to prove uniqueness, it suffices to show that {ut,rht)te[o,T] is a solution to (jl71l) (for 
some martingale (Afi)ts[o,T])- 

We first investigate the dynamics of {mt)te[o,T]- As in the first step {existence of a solution), 
we may apply Ito-Wenztell formula for distribution-valued processes. Indeed, thanks to 
Theorem 1.1] (with the formal writing {mt{x) = mt{x + ^/2Wt))te[o,T])■, we get exactly that 
{mt)te[o,T] satisfy the first equation in (11711) . 

In order to prove the second equation in (jl71l) , we apply Ito-Wentzell formula for real-valued 
processes to {ut{x) = ut{x + '\/2ITt))tE[o,T]i see [33l Theorem 3.1]. 


116 




















6 Convergence of the Nash system 


In this section, we consider, for an integer V ^ 2, a classical solution of the Nash 

system with a common noise: 

-dtv^’\t,x) - j3j^TTDl.^^^v^’\t,x) + H{xi,Da;^v^’\t,x)) 

j 

+ J] DpH{xj,D^.v^’^{t,x)) • D^y’\t,x) = F{xi,m^’^) in [0,r] x (T'=')^, 
v^’^T, x) = G(xi, m^’*) in 

^ (175) 

where we set, for x = (xi,..., xn) e = — — - ^ 6xy Our aim is to prove Theorem 

ji=i 


12.131 which says that the solution (u^’*)jg{i converges, in a suitable sense, to the solution of 
the second order master equation and Theorem l2.151 which claims that the optimal trajectories 
also converge. 

Throughout this part we assume that H, F and G satisfy the assumption of Theorem 12.111 
with n ^ 2. This allows us to define U = U{t,x,m) the solution of the second order master 
equation 


^ -dtU - (1 + /3)AxU + Hix, DxU) - (1 + /?) [ divj, [D^U] dm{y) 

+ [ DmU ■ DpH{y,DxU) dm{y) 

Jjd- 

-2/3 [ diVa; [DmU] dm{y) - /3 f TrDl^^U dm{y)dm{z) = F{x,m) 
JT'i J[T‘^]2 

in (0,r) X T"* X 

U(T,x,m) = G{x,m) in x P(T'^), 


(176) 


where /3 ^ 0 is a parameter for the common noise. For a' e (0,a), we have for any {t,x) e 
[0,r] X T'^, m,m' 6 V{T<^) 





5‘^U . 

+ 


+ 

(n+2+Q:^,n+l+a;^) 



{n+2+a',n+a',n+a') (177) 


^ Co, 


and that the mapping 


[0, T] X P(T'^) 3 (t, m) 


6^U 

5m? 


it,' 




•,•) eC 


71+2+0;' /nr^ 


(T'^) X 


C 


n+o' ffjrd 


(pjpd 


(178) 


is continuous. As already said, a solution of (I176p satisfying the above properties has been built 
in Theorem 12.111 When /3 = 0, one just needs to replace the above assumptions by those of 
Theorem 12.81 which does not require the second order differentiability of F and G with respect 
to m. 

The main idea for proving the convergence of the (u'^’*)jg{i,,, ^v} towards the solution U is 
to use the fact that suitable finite dimensional projections of U are nearly solutions to the Nash 
equilibrium equation. Actually, as we already alluded to at the end of Section [2l this strategy 
works under weaker assumptions than that required in the statement of Theorem 12.111 What is 
really needed is that H and DpH are globally Lipschitz continuous and that the master equation 
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has a classical solution satisfying the conclusion of Theorem 12.111 (or Theorem 12.81 if /3 = 0). In 
particular, the monotonicity properties of F and G have no role in the proof of the convergence 
of the A^-Nash system. We refer to Remarks 16.51 and 16.71 below and we let the interesting reader 
reformulate the statements of Theorems 12.131 and 12.151 accordingly. 


6.1 Finite dimensional projections of U 

For N ^ 2 and z e {1,... ,N} we set 

where® = {xi,...,xn) e ^ ^ ^ ^ 4, 

j^i 


Note that the are at least with respect to the Xi variable because so is U. Moreover, 
dtu^’^ exists and is continuous because of the regularity of U. The next statement says that 
is actually globally in the space variables: 


Proposition 6.1. For any N ^ 2, i e {1,... ,N}, is of class in the space variables, with 


Dr,.u^’\t,x) = ^ DmU{t,Xi,m^’\xj) 

Dl^,^.u^’\t,x) = j^^^D,,DmU{t,Xi,m^’\xj) 


[DmU] {t,Xi,m^’\xj) 


1 


N,i 


{N - 1)2 




while, if j k, Dl. ,^^u^’\t,x) = -jj^^^Dl^JJ{t,Xi,m^’\xj,Xk) 


U ^ i), 

{j ^ ^)> 

U ^ *) 

{i,j,k distinct). 


Remark 6.2. If we only assume that U has a hrst order derivative with respect to m, one can 
show that, for any ^ 2, z e {1,..., N}, is of class in all the variables, with 


D,, u^’\t,x) = 


1 


N - 1 


DmU{t,Xi,m^’\xj) 


\fj ^ i, 


with a globally Lipschitz continuous space derivative. The proof is the same except that one 
uses Proposition 17.31 instead of Proposition 17.51 

Proof. For x = {xj)je{i^,,,^N} such that Xj ¥= Xk for any j ^ k, let e = minj^^ \xj — Xk\. For 
V = (vj) e (M'^)^ with Uj = 0 (the value of z e {1,..., N} being fixed), we consider a smooth 
vector field (j) such that 

(p{x) = Vj if X 6 B{xj, e/4), 

where B{xj,e/4:) is the ball of center xj and of radius e/4. Then, in view of our assumptions 
(I177P and ()178p on U, Propositions 17.51 and 17.61 in Appendix imply that 


U {t, Xi, {id + (/>)tlm^’*) - U {t, Xi,m^’^) - [ D^U {t, Xi,m^’\y) • (/(y) dm^’^y) 

JTd 

Dy[D^U]{t,Xi,m^’\y)4>{y) ■ (j){y) dm^’^y) 


1 

2 J'^d 

hi f 

^ JTd- Jt^ 


DjamU{t,Xi,m^'\y,y)(t){y) ■ (j){y') dm^’\y)dm^'\y') 




Ilia 
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for some modulus uj such that a;(s) —> 0 as s ^ 0~*". Therefore, 


X + v) — x) 

= DmU (t, Xi, m^’\y) ■ (j){y) dm^’\y) + ^ 


Td 


Dy[DraU] {t,Xi,m^’\y)(l){y) ■ (j){y)dm^’\y) 


kf f 

^ JT<i JT'i 




N -1^. 


^ DmU (t, Xi, m^'\ Xj) • Vj + 2 Uy[DmU] {t, Xi,m^’\xj)vj ■ vj 

2 ^limU{t,Xi,m^'\Xj,Xk)vj ■ Vk + \v\‘^Uj{\v\). 


j,k^i 


This shows that has a second order expansion at x with respect to the variables [xj)j^i 


and that 




D,^.u^’\t,x) - ^ _^DmU{t,Xi,m^’\xj) 

{j ^ 


^ ^Dy[DmU]{t,Xi,m^’\xj) 



T (jV -^yi^mmU Xi,m^ ,Xj,Xj^ 

{j ^ i) 

while, if J k. 


{i,j,k distinct) 


So far we have proved the existence of first and second order space derivatives of U in the open 
subset of [0,T] x consisting in the points (t, x) = (t, xi, • • • xn) such that Xi ^ Xj for any 

i ^ j. As DmU, Dy YDmU^ and D'^^U are continuous, these first and second order derivatives 
can be continuously extended to the whole space [0,T] x and therefore is with 

respect to the space variables in [0, T] x T^'^. □ 

We now show that is “almost” a solution to the Nash system ()175p : 

Proposition 6.3. One has, for any i e N}, 


^ - /3 2 + Hix„ 

j j,k 

+ ■ DpH{xj,D,^jU^’^{t,x)) = F{xi,m^’'-) + r^’\t,x) 

3¥=i 

in (0,r) X 

^ u'^’*(T, a;) = G(xj, m^’*) in 


(179) 


where r^’"^ e C°([0,T] x T*^) with 



< 


C 

N' 


Remark 6.4. When /3 = 0, we can require U to have only a first order derivative with respect 
to the measure, but in this case equation (I179p only holds a.e. with e L® still satisfying 


„Nj|i 


< 


c 

N’ 
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Proof. As U solves (jl76p . one has at a point 

- dtU - {1 +/3)AxU + H{xi,DxU) - {1 + f3) [ divy[DmU](t,Xi,mx'\y)dmx'\y) 

Jrd- 

+ [ DmU{t,Xi,m^’\y) ■ DpH{y,DxU{t,y,m^’'))dm^'\y) 

- 2/3 [ diY^[DmU]{t,Xi,m^’\y)dm^’\y) 

JTd 

-p [ TrD‘^^U{t,Xi,m^’\y,z)dm^’\y)dm^'Pz) =F{xi,m^’^). 

So satisfies: 

- dtu^’^ - (1 + /3)A^,n^’* + H{xi,D^.u^’P - (1 + /3) [ divy[DmU]{t,Xi,m^’\y)dm^’\y) 
+ 1 YiDmU{t,Xi,m^’\xj) ■ DpH{xj,D^U{t,Xj,m^’P) 

jiti 

- 2 P f diVx[DmU]{t,Xi,m^’\y)dm^’\y) 

JTd- 


-p [ TrD‘^^U{t,Xi,m^'\y,z)dm^’Py)dm^’\z) = F{xi,m^’P. 
JT'i 


Note that, by Proposition 16.1 




In particular, 


W^.U^’iao ^ i-r- 

II 11^ ^ J\T 


c 


By the Lipschitz continuity of D^U with respect to m, we have 




C 


iV- 1’ 


so that, by Lipschitz continuity of DpH, 


\DpH{xy,D^U{t,xj,m^’P) - DpH{xj,D^.u^’^{px))\ ^ 


Collecting the above relations, we obtain 
1 


N - 


- ^ DmU{t,Xi,m^’\xj) ■ DpH{xj,D^U{t,Xj,m^’P) 


j^i 


^ D^.u^’Pt,x) ■ DpH{xj,D^U{t,Xj,m^^'-)) 


j^i 


Y^ Dx^’^px) ■ DpH{xj,D^y’^{t,x)) +0{1/N), 

j¥=i 


(180) 


(181) 
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where we used (11801) in the last inequality. On the other hand, 


N 


N 


i=i 


/? 2 = (1 + + (1 + /3) ^ 


j,A;=l 


j^i 


.U 


j^i 

f;S 2 ™ 


2 N,i 
Xj,Xk“‘ ’ 


where, using Proposition 16.11 

j¥=i 


jiti 

Therefore 


[ divy[DmU]{t,Xi,m^’\y)dm^’\y) 

JT'i 

+ U]{t,Xi, ,y,y) dm^’^(y) 

[ dwa:[DmU]{t,Xi,m^’\y)dm^’\y) 

[ [ T^^[Di,rnU]{t,Xi,m^'\y,z)drn^’\y)dm^'\z). 

Jjd Jjd 


dtu^’\t,x) - J^A^^u^’\t,x) - + H{xi,Da;,u^’\t,x)) 


j,k 


+ 2 Dx.u^^\t,x) ■ DpH{xj,DxjU^’^{t,x)) 


j^i 


+ lT^^D‘imU{t,Xi,m^’\y,y)dm^’\y) = F{xi,m^’^) + 0{l/N), 


I'Id 

which shows the result. 


□ 


Remark 6.5. The reader may observe that, in addition to the existence of a classical solution 
U (to the master equation) satisfying the conclusion of Theorem \2.11l only the global Lipsehitz 
property of DpH is used in the proof, see (I18ip . 


6.2 Convergence 

We now turn to the proof of Theorem 12.181 For this, we consider the solution (u'^’*)jg{i of 
the Nash system (I175p . By uniqueness of the solution, the must be symmetrical. 

By symmetrical, we mean that, for any x = ^ ^md for any indices j k, if 

X = {xi)ie{i^...^N} is the A-tuple obtained from x by permuting the j and k vectors (i.e., xi = xi 
for I f {j, k}, Xj = Xk, Xk = Xj), then 

x) = v^’^{t, x) if i f {j, k}, while x) = v^’’^{t, x) if i = j, 

which may be reformulated as follows: There exists a function : T'^ x —> M such 

that, for any x e T'^, the function ^ ^ yN-i) V^{x, {yi,. .., y7v-i)) is invariant 

under permutation, and 

\fie ,N}, xe v^’\t,x) = {xi,{xi,... ,Xi-i,Xi+i,... ,xn)) ■ 
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Note that the (u'^’*)iE{i,...,iv} are also symmetrical. 

The proof of Theorem 12.131 consists in comparing “optimal trajectories” for and for 
for any i 6 {1,..., N}. For this, let us fix to ^ ^ 'P{T'^) and let 

be an i.i.d family of N random variables of law mo. We set Z = (.^j)jE{i n}- Let also 

m) t£[o,T])is{i,...,Af} be a family of N independent d-dimensional Brownian Motions which is 
also independent of (•^i)jG{i,...,Af} let VF be a d-dimensional Brownian Motion independent 
of the ((B|)ig[o,T])i£{i,...,iV} and (-^i)iE{i,...,Af}- We consider the systems of SDEs with variables 
{Xt = (^i,t)iE{i,...,Ar})t£[o,r] and {Yt = (Ti,t)ig{i^..._Ar})te[o,r](the SDEs being set on with 
periodic coefficients): 

dXi^t = -DpH{Xi,t,D,,u^’\t,Xt))dt + V2dBi + ^dWt 

^i,tQ ~ ^i-) 

and 

dY,,t = -DpH{Yi^t,D,,v^’^{t,Yt))dt + V2dB| + ^dWt 
= Zi. 

Note that, since the {u^'^)ie{i,...,N} are symmetrical, the processes ((^i,t)tE[to,T])i£{i,...,Af} are 
exchangeable. The same holds for the {{Yi^t)te[to,T])ie{i,...,N} and, actually, the N M^'^-valued 
processes iiXi^t,Yi^t)te[to,T])ie{i,...,N} are also exchangeable. 


t e [to,T] 


(182) 




11831 


Theorem 6.6. Under the standing assumptions, we have, for any i e {!,...,A^}, 
E[ sup_jTi,* - W,t|] ^ 


te[to,T] 


E 


sup \u^’\t,Yt) - v^’\t,Yt)\ 

.te[to,T] 

+ r\D^^v^\t, Yt) - Yt)\^dt 

ho 


^ CN 


-2 


and, P almost surely. 


N 


N 


Y,\v^'\to,Z)-u^^\to,Z)\^CN-\ 


i=l 


(184) 


(185) 


(186) 


where C is a (deterministic) constant that does not depend on to, mo and N. 

Proof of Theorem 1 6 . 61 First step. We start with the proof of (|185l) . For simplicity, we work 
with to = 0. Let us first introduce new notations: 

C/f’* = n^’*(Ll^t), V,^'^ = v^’\t,Yt), 

Yt), Yt), te [0, T]. 


Using equation ()175p satisfied by the (u^’*)ig{i^. we deduce from Ito’s formula that, for any 
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ze 




dtv^’\t, Yt) - 2 Yt) • DpH{Yj,t, Yt)) 


2 Yt)+f3j] Yt) 


j,k 


dt 


+ x /2 2 Yt)dBi + Yt)dWt 

j 3 

[H{Yi^t, 1-t)) - Yt) • DpH{Yi,t, 1^*)) 


( 187 ) 


-F{Yt^t,m^i) 


dt 


+ V2Y,D,jv'^’\t,Yt)dB{ + 

3 3 

Similarly, as (^i'^’*)iG{i,,,,,Ar} satisfies (11791) . we have by standard computation 


dU, 


N,i 


H{Yi,t, Yt)) - Yt) • DpH{Y,^t, Ft)) 


-F{Y,,t,m^'i)-r^\t,Yt) 


dt 


(188) 


-Y,Dx,u^’\t,Yt) • (^DpH{Y,^t,D^yd^t,Yt)) - DpH{Y,^t,D,.u^’^it,Yt)))dt 
3 

+ V2Y,D^.u^^\t, Yt) • dBi + 

3 3 

Make the difference between (jl87p and pi88p . take the square and apply Ito’s formula again: 

2(C/f’' - Vt^'^) ■ (H[Yi^t,DU^'^'^) - H[Yi^t,DVt^'^'^)) 

- 2([/f’* - • (^DUt^’^’^ ■ [DpH{Yi^t,DUt^'^'") - DpH{Yt^t, 

- 2([/f’* - Vt^’^) ■ ([DC/f’*’* - DVt^’^’^] • DpH{Yi^t,DVt^’^’^ 


- 2([/f’* - Vt^'")r^'\t,Yt) 


dt 


2([/f.* _ • [DpH{Y^^t,DVt^^^d^ - DpH{Y,,t,DU^’^'^)) 


2^ |ZlC/f" - DVt^'^d\2 ^ 2/3|2(-Dt/f’*’^' - HV)^’*’^') 


dt 


dt 


V2 2(Z)t/f• dBi + ■ dWt 


Recall now that H and DpH are Lipschitz continuous in the variable p. Recall also that 

N^i\ •_ 1 _ T 1 , 1 __ 1 _• 7 V 7 - _ _1 j_ _ _1 j_l_ _ j_ T^TT^Ad _ 


= DxiU{t,Yi^t,nT'Yi) bounded, independently of i, N and t, and that DUi 
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DxjU{t,Yi^t,'>TT'Y^) is bounded by C/N when i =|= j, for C independent of i, j, N and t. Re¬ 
call finally from Proposition 16.31 that is bounded by C/N. Integrating from t to T in 
the above formula and taking the conditional expectation given Z (with the shorten notation 
K^[-] = E[-|Z]), we deduce: 



(189) 


Note that the boundary condition — V^’* is zero. By a standard convexity argument, we 
get 


+E" 


i: 




' [I I/f- vf’^ 1 2] ds + i 2 E^ / I fds 
^ j 

By Gronwall’s Lemma, we finally get (modifying the value of the constant C): 

[f 


sup E'^[|C/f’* - Pi^’T] + E-^ 
tG[o,r] 

rT 

X ^—> r7 / 

< 




’dd 12 


(190) 


Taking the expectation and using the exchangeability of the processes 
we obtain (I185D . 




Second step. We now derive (|184p and (|186p . We start with (|186p . Noticing that Uq'^ — 
Pq^’* = M^’*(0, Z) — -u'^’*(0, Z), we deduce, by summing pOUp over i 6 {1,..., N}, that, with 
probability 1 under P, 


1 ^ 

-y 

N ^ 


u 


N,i 


{0,Z) -V^’\0,Z 




i=l 


c 

iv’ 


which is exactly p86l) . 

We are now ready to estimate the difference W,t — Yi,t, for t e [0, T] and i e {!,..., Nj. In 
view of the equation satisfied by the processes (-Pi,t)iE[o,T] by (Pi,t)te[o,r]) have 


i^i,t - yi,ti ^ 


f)Dpff(X,^,, DxM^’^(s, XP) - Dpff(Yi^,, P ,))\ds 

Jo 




•I 


C / |Xi,, - Yi,s\ds + C 


f 


’fo - T»P 




ds. 


By Gronwall inequality and by (I19np . we obtain p84h . 


(191) 

□ 
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Remark 6.7. The reader may observe that, in addition to the existence of a classical solution 
U (to the master equation) satisfying the conclusion of Theorem \2.11l only the global Lipschitz 
properties of H and DpH are used in the proof, see (I189p and (I19ip . 

Proof of Theorem \2. 131 For part (i), let us choose mo = 1 and apply (11861) : 


1 

N 



(to, Z) 


^ CN 


-1 


a.e., 


where Z = (Zi,..., Zjy) with Z\,..., Zjq i.i.d. random variables with uniform density on 
The support of Z being we derive from the continuity of U and of the 

that the above inequality holds for any x e 


1 ^ 

^ Xj ^ ^ V®e(T 


rd\N 


i=l 


Then we use the Lipschitz continuity of U with respect to m to replace U(to,Xi,mx'^) by 
U{to,Xi,m^) in the above inequality, the additional error term being of order 1/N. 

For proving (ii), we use the the Lipschitz continuity of U and a result by Dereich, Scheutzow 
and Schottstedt [23] to deduce that, for d ^ 3 and for any Xj e T'^, 


/ \u^’\t,x) - U{t,Xi,mQ)\ T~[ mo(dxj) 

= / \U{t,Xi,m)^’")-U{t,Xi,mo)\]lmoidxj) 

jJ-- 

f di(m^’*, mo) rr mo(dxj) ^ 

JTd(N-l) 

If d = 2, following Ajtai, Komlos and Tusnady [6|, the right-hand side has to be replaced by 
log(N). Combining Theorem 16.61 with the above inequality, we obtain therefore, for d ^ 3, 




/ / v^^"{t,{xj))Y]mQ{dxj) -U{t,Xi,mQ) 

JTd(N-l) 


dmo(xi) 


N 


^E[\v^’\t,Z)-u^'''{t,Z)\] + / - [/(t,Xi,mo)| r~[ mo(dxj) 

JjdN 

^CN-^ + CN-^I^ ^ CN-^/^. 


As above, the right-hand side is N log(A^) if d 


2. This shows part (ii) of the theorem. □ 


Proof of Corollary \2.14\ We fix (t,xi,m) e [0,r] x x P(T'^) and assume that there exists 
u 6 M such that 


lim sup 


N,1 


N—*-\-cc, m / 




(t, x’) — u| = 0. 


Our aim is to show that, if xi belongs to the support of m, then v = U(t,xi,m). For this we 
first note, from a standard application of the maximum principle, that the ('y'^’*)iE{i,...,iV} 
uniformly bounded by a constant M (independent of N). 
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Fix e > 0. By our assumption there exists Nq > 0 and h > 0 such that 

\v^’^{t,x') — n| ^ e if iV ^ Nq, ^ 6 and |xi — x'i\ ^ <5. 


As 


N 


lim / 

A'"— >+GO J^d-^N-l 


‘ii Wm{dx'j) = 0, 

i=2 


(192) 


we can also choose Nq large enough so that 

/ 1/j ( JV.I FT midx'A ^ e if iV ^ Aq and \xi - xA ^ 5. 

Jijd-^N-l ,m)s=5) ^ 

Then, integrating (I192h over we obtain 

\w^’^{t,x'i) — n| ^ e + Me = e(M + 1) if A ^ Aq and |a:i — x'i\ ^ 5. 

We now integrate this ineqnality with respect to the measure m on the ball B{xi,5)\ 

f x'l) — n| dm{x'i) ^ e(M + l)m(i?(xi, 5)). 

Jb{xuS) 

Now Theorem I2.131 1iil states that converges in to U{t,-,m). Thns, letting A —> 

+00 in the above inequality, we get 

/ \U{t, x'i,m) — n| dm{x'i) ^ e(M + l)m(B(xi, 5)). 

Jb{xuS) 

Since U is continuons and xi is in the support of m, this last inequality implies that v = 
U{t,xi,m). □ 


6.3 Propagation of chaos 

We now prove Theorem 12.151 Let us recall the notation. Throughout this part, is 

the solution of the Nash system (I175D and the are “optimal trajectories” 

for this system, i.e., solve (|183l) with = Zi as initial condition at time to- Our aim is to 
understand the behavior of the ((Ti,t)te[to,T])i£{i,..., 7 V} a large number of players A. 

For any t 6 {1,... , A}, let {Xi^t)te[to,T] be the solution the SDE of McKean-Vlasov type: 

dXi,t = -DpH (W,t, D,U{t, Xi,t,£iXi,t\W)^ dt + V2dBi + ^dWt, X,^t, = Z,. 


Recall that, for any z e {1,... , A}, the conditional law VF) is equal to (mt) where {ut, mt) 

is the solution of the MFG system with common noise given by (I128D - (I129|) (see section 15.4.31) . 
Solvability of the McKean-Vlasov eqnation may be discussed on the model of (|148l) . 

Our aim is to show that 


E 


sup \Yi^t - Xi^t 
Lts[to,T] 




for some C > 0. Before starting the proof of Theorem 12.151 we need to estimate the distance 
between the empirical measure associated with the iXi^t)iG{i,...,N} and rrit. For this, let ns set 
Xt = iXi^t)ie{i,...,N}- As the (Aj_t) are, conditional on W, i.i.d. random variables with law mt, 
we have by a variant of a result due to Horowitz and Karandikar (see for instance Rashev and 
Riischendorf [62], Theorem 10.2.1): 
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Lemma 6.8. 


E 


sup di 
'-te[to,T] 


m 


N,i 


rrit 


^ C'iV-V('i+8)_ 


Proof. The proof is exactly the same as for Theorem 10.2.7 in [62] (for the i.i.d. case). In this 
proof independence is only used twice and, in both cases, one can simply replace the expectation 
by the conditional expectation. □ 


Proof of Theorem \2.15[ The proof is a direct application of Theorem 16.61 combined with the 
following estimate on the distance between the solution {Xi^t)te[to,T] of (|182l) : 


E 


sup \Xit 

U[to,T] 





(193) 


Indeed, by the triangle inequality, we have, provided that (I193p holds true: 


E 

sup \Yi^t-Xi,t\ 

^e[ sup \Yi^t-Xi^t\ 

+ E 

¥ 

1 


He[to,T] ^ 

'-te[to,T] J 


4£[to,T] 


^ C7(A^“^ + A^“^drf+8))^ 


where we used ()184p to pass from the first to the second line. 

It now remains to check (I193p . For this, we fix i 6 {1,..., A^} and let 

p{t) = e[ sup - Xi^s\ . 


Then, for any s e [foT]) we have 


Xi^s - Xi^s\ ^ [ \-DpH[Xi^r,Dx^u^’\r,Xr)) + DpH[Xi^r, D^U{r, Xi^r,'mr))\dr 
Jto 


^ I \-DpH{Xi^r,D^U{r,Xi^r,m^’i)) + DpH{Xi^r,D^U{r,Xi^r,m^y))\dr 


N,i\ 


'to 


+ f \-DpH[Xi^r,D^U{r,Xi^r,m^'')) + DpH[Xi^r,D:cU{r,Xi^r,mr))\dr. 
Jto 

As {x,m) DxU{t,x,m) is uniformly Lipschitz continuous, we get 

\Xi^s-Xi,s\ ^ ^ (^\Xi,r - Xi,r\ + di(rn^l,rn^'') + di(rn^\mr)^dr, 


where 


j^i 


( 194 ) 


Hence 
























Taking the supremum over s e and then the expectation, we have, recalling that the 

random variables — Xj^r)je{i,...,N} have the same law: 


pit) = IE 


sup \Xi^s - Xi^, 
seho,i] 
t 


I 

J tc 


UK 


to \ '-re[to,s] 

CE 


sup I y T 


N- 


- > E sup \Xj^r — Xj^r 


jjti f’G[io,s] 


ds 




sup di TUr.) 

'-re[to,T] 

C [ pis)ds + CN-^/^^+^\ 
Jto 


where we used Lemma 16.81 for the last inequality. Then Gronwall inequality gives (|193p . □ 
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7 Appendix 


We now provide several basic results on the notion of differentiability on the space of probability 
measures used in the paper, including a short comparison with the derivative on the set of 
random variables. 


7.1 Link with the derivative on the set of random variables 

As a first step, we discuss the connection between the derivative bUjbm in Definition 12.11 and 
the derivative introduced by Lions in [52] and used (among others) in [T6l[22] . 

The notion introduced in [52] consists in lifting up functionals defined on the space of prob¬ 
ability measures into functionals defined on the set of random variables. When the underlying 
probability measures are defined on a (finite dimensional) vector space E (so that the ran¬ 
dom variables that are distributed along these probability measures also take values in E\ this 
permits to benefit from the standard differential calculus on the Hilbert space formed by the 
square-integrable random variables with values in E. 

Here the setting is slightly different as the probability measures that are considered through¬ 
out the article are defined on the torus. Some care is thus needed in the definition of the linear 
structure underpinning the argument. 


7.1.1 First order expansion with respect to torus-valued random variables. 

On the torus we may consider the group of translations {Ty)y^^d, parameterized by elements 
X of For any y 6 M'^, Ty maps T'^ into itself. The mapping 3 y t—> Ty{0) being obviously 
measurable, this permits to define, for any square integrable random variable X e A, P; M'^) 

(where (H, A, P) is an atomless probability space), the random variable Tj^(O), which takes values 
in T'^. Given a mapping U : P(T'^) —> M, we may define its lifted version as 

U : l2(H, a, P; M'^) 9 X ^ tj{X) = [/(£(r^(0))), (195) 

where the argument in the right-hand side denotes the law of t^( 0) (seen as a T'^-valued random 
variable). Quite obviously, C{t^{Q)) only depends on the law of X. 

Assume now that the mapping U is continuously Frechet differentiable on L^(H, .4, P; M'^). 
What [52] says is that, for any X e L^(H, P; M'^), the Frechet derivative has the form 

DU{X) = ^(£(X))(X), P almost surely, (196) 


for a mapping {dyU{C{X)) 3 y ^ dyU{C{X)){y) e 6 L^(]R‘^, £(X)). This relationship 
is fundamental. Another key observation is that, for any random variables X and Y with values 
in and ^ with values in Z'^, it holds that 


lim — 

e^O e 


U{X+ l + eY)-U{X) =¥. {DU{X+ i),Y) 


which is, by the simple fact that = r^(0), also equal to 

1 


lim 
£->•0 e L 


U{X + eY)-U{X) =E (DU{X),Y) 


proving that 

DU{X) = DU{X + i). 


(197) 
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Consider now a random variable X from Vl with values into With X, we may associate 
the random variable X, with values in [0,1)'^, given (pointwise) as the only representative of X 
in [0,1)'^. We observe that the law of X is uniquely determined by the law of X and that for 
any Borel function /i : ^ M, 

E[h(X)] = E[h{X)], 

where h is the identification of h as a function from [0,1)'^ to M. 

Then, we deduce from (I196p that 

DU{X) = d^l7(T(X)) (X), P almost surely. 

Moreover, from (I197p . we also have, for any random variable | with values in Z'^, 

DU{X + I) = d^?7(T(X)) (X), P almost surely. 

Since d^U (T(X))(-) is in C{X)) and X takes values in [0, l)*^, we can identify d^l7(T(X))(-) 

with a function in £(X)). Without any ambiguity, we may denote this function (up to a 

choice of a version) by 

T^3y^d^U{/:{X)){y). 

As an application we have that, for any random variables X and Y with values in T'^, 

U{C{Y)) - U{C{X)) = U{Y) - U{X) 

= E ^ (dU{C{\Y + (1 - A)X)), Y - X^dX. 

Now, we can write 


AT + (1 - A)X = X + A(y - X) = Z, with Z = 

Noticing that Z is a random variable with values in we deduce that 

U{C{Y)) -U{C{X)) = El^\d^U{C{T,^y_j^^{X))){T^^^_j^^{X)),Y - xyx. 
Similarly, for any random variable ^ with values in Z'^, 


U{CiY)) - U{C{X)) = U{Y + i) - U{X) 

= E^ (^DU{X + X{Y + i-X)),Y + i-X^dX. 

Now, X + A(y + ^ — X) writes Z + (^, where C is a random variable with values in Z*^ and Z is 
associated with the T'^-valued random variable Z = T^(y_,_^'_j^)(X), so that 


U{£{Y))-U{C{X))=eI^\dU{Z)., 


Y + ^-X)dX 


= E 


0 


(198) 


The fact that ^ can be chosen in a completely arbitrary way says that the choice of the repre¬ 
sentatives of X and Y in the above formula does not matter. Of course, this is a consequence 


130 






of the periodicity structure underpinning the whole analysis. Precisely, for any representatives 
X and Y (with values in of X and Y, we can write 

U{C{Y)) - U{C{X)) = El^\d^U{C{T,^y_j^){X)){T^^y_j^-^{X)),Y - xyx. (199) 

Formula (|199p gives a rule for expanding, along torus-valued random variables, functionals 
depending on torus-supported probability measures. It is the analogue of the differentiation 
rule defined in [52] on the space of probability measures on through the differential calculus 
in L2(fi,^,P;M'^). 

In particular, if tj is continuously differentiable, with (say) DU being Lipschitz continuous 
on L^(n, P; M'^), then (with the same notations as in (II98IP 

E[\DU{Y) - DUiX)\^] = E[\DUiY + i) - DUiX)\^] 

^ CE[|y+ 

Now, for two random variables X and Y with values in the torus, one may find a random variable 
with values in such that, pointwise, 

i = aigmm^^zd\Tc{Y) - X\, 

the right-hand side being the distance djd{X,Y) between X and Y on the torus. Put it differ¬ 
ently, we may choose ^ such that \Y + ^ — X| = djd{X,Y). Plugged into (12001) . this shows that 
the Lipschitz property of DJJ (on L^(n, P; M*^)) reads as a Lipschitz property with respect to 
torus-valued random variables. 

Next, we make the connection between the mapping P(T'^) x 9 (m, y) ^ dfjU{m){y) e 
and the derivative V{T'^) x 3 (w-,2/) [6U/dm]{m,y) 6 defined in De f i n ition 12.11 


7.1.2 Prom differentiability along random variables to differentiability in m 


Proposition 7.1. Assume that the function U is differentiable in the sense explained in Suh- 
subsection \ 7.1.1 and thus satisfies the expansion formula ()199p . Assume moreover that there 
exists a continuous version of the mapping d^JJ : VifE^) x 3 [fn^y] ^ S^jU{m,y) e M'^. 

Then, U is differentiable in the sense of Definition IKH Moreover, 5U/5m is continuously 
differentiable with respect to the second variable and 


D^Uim,y) = df,Uim){y), m e P(T‘'), y e T^. 

Proof. First step. The first step is to prove that, for any m e P(T'^), there exists a continuously 
differentiable map V{m, •) :T‘^ 3 y ^ V{m, y) eM. such that 

df,U{m){y) = DyV{m,y), yeT‘^. 

The strategy is to prove that dyU{m) : dyU{m){y) is orthogonal (in Lf(fE^,dy)) to 

divergence free vector fields. It suffices to prove that, for any smooth divergence free vector field 

[ (dy,U{m){y),b{y)}dy = 0. 

JTd- 

Since is jointly continuous in {m,y), it is enough to prove the above identity for any m 
with a positive smooth density. When m is not smooth, we may indeed approximate it hy m*p, 
where * denotes the convolution and p a smooth kernel on R'^ with full support. 
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With such an m and such a b, we consider the ODE (set on but driven by periodic 
coefficients) 

dXt = dt, t ^ 0, 

m{Xt) 

the initial condition Xq being [0, l)'^-valued and distributed according to some m e P(T^) 
(identifying m with a probability measure on [0,1)'^). By periodicity of b and m, (W)t>o 
generates on T'^ a flow of probability measures (mj)tj.o satisfying the Fokker Planck equation 

dtmt = —div(—rui), t ^ 0, mo = m. 
m 

Since b is divergence free, we get that mt = m for all t ^ 0. Then, for all t ^ 0, 

U (mt) - U (mo) = 0, 

so that, with the same notation as in (11951) . limt\^o[(C^(-^t) ^ f^(^o))A] = 0. Now, choosing 
Y = Xt and X = Xq in (|199p . we get 


[ (df,U{m){y),b{y)}dy = 0. 
JT'i 

We easily deduce that d^[/(m) reads as a gradient that is 

df,U{m){y) = dyV{m,y). 

It is given as a solution of the Poisson equation 


AV{m,y) = divy dy,U{m){y) 

Of course, V{m, •) is uniquely defined up to an additive constant. We can choose it in such a 
way that 

/ V{m,y)dm{y) = 0. 

Using the representation of the solution of the Poisson equation by means of the Poisson kernel, 
we easily deduce that the function V is jointly continuous. 

Second step. The second step of the proof is to check that Dehnition 12.11 holds true. Let us 
consider two measures of the form and my, where N e N*, X = (xi,... ,xn) e (T'^)^ is 
such that Xi Y xj and Y = {yi,, yj^) e (T'^)'^. Without loss of generality we assume that the 
indices for Y are such that 

1 iv ^ N 

di(m^,m^) = —J^djd(xi,yi) = I®* “^*1’ (^^l) 

i=i i=i 


where xi ,...,xat and yi,... ,yN are well-chosen representatives, in of the points xi ,...,xat 
and yi,..., i/Tv in T'^ {djd denoting the distance on the torus). Let X be a random variable such 
that P(X = Xi) = 1/N and Y be the random variable defined by E = yi Y X = Xj. Then, with 
the same notations as in (jl95p . P£(t^(o)) = P/:(Ty(o)) = my. 

Thanks to ()199p . we get 


U{m^) - U{m^) = £ (r,yy(i_,)x(0)), ^ 


dX 
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So, if It; is a modulus of continuity of the map on the compact set P(T^) x we obtain 
by (imi) : 


/ E (0)), F - X 


dX 


^ E[|y — X\\]w(^di{mx,mY)) = di(m;^,my)i(;(di(m;^,my)). 
Moreover, since DyV{m,y) = d^U{m){y), we have 


E 


SiiU{mx){TxY+{i-\)xi^)) (''■Ay+(i-A)x))^ “ ^ 


dX 


I ^ ri 

= N 2 {mxXxyi+a-^M) ’ yi - 




DyV {rrix, Ay* + (1 - X)xi ), y* - Xi}dX, 


where we saw DyV{m^, •) as a periodic function defined on the whole Then, 
E (^dnU{mx){TxY+{i-x)x{^)){T'xY+(i-\)x)i^ ~ ^ 


( 202 ) 


dX = / V{mx,x)d{mY — rnx){x). 

By density of the measures of the form and my and by continuity of y, we deduce from 
(j202p that, for any measure m,m' e V{T'^), 


U{m') — U{m)— / V{m,x)d{m'— m){x) 
JTd- 


^ di(m,m')tc(di(m,m')), 


which shows that U is in the sense of Definition 12.11 with ^ = V. 


□ 


7.1.3 Prom differentiability in m to differentiability along random variables 

We now discuss the converse to Proposition 17.11 

Proposition 7.2. Assume that U satisfies the assumption of Definition 1 2. Hi Then, U satisfies 
the differentiability property (|199p . Moreover, DmU{m,y) = dyU{m){y), me ViT'^) andye T*^. 

Proof. We are given two random variables X and Y with values in the torus T'^. By Definition 

[m 


U{£{Y)) - U{£{X)) 

/'][’d 6m 
6U 


f L ^ " X)£{X),y)d{£{Y) - £{X)){y) 


dX 


L 


= / E 


6U 


{X£{Y) + (1 - X)£{X),Y) - —{X£{Y) + (1 - X)£{X),X) 
dm dm 


1 /-i 


E 


0 JO 


Dy^{X£{Y) + (1 - A)£(X)) (AW + (1 - A')X)(F - X) 


dX 
dXdX' 


where X and Y are valued random variables that represent the T'^-valued random variables 
X and Y, while Dy[dU/dm]{m, •) is seen as a periodic function from into 
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By uniform continuity of DmU = Dy\5U /6m\ on the compact set P(T'^) x we deduce 
that, 


U[C{Y))-U[C{X)) 
6U 


= E 


Dyj^{^{X)){X){Y-X) 


■ E[|X - yp]V2y;(E[|x - 


(203) 


for a function w : M+ ^ ]R+ that tends to 0 in 0 {w being independent of X and Y). Above, we 
used the fact that di(£(A), C{Y)) ^ E[|A — 

Let now Z\ = 'r_)^cp_x)(^); for A e [0,1], so that ^a+e = '^£(y-x)(-^A)) for 0 ^ A ^ A + e ^ 1. 
Then, (A + e)Y + [1 — (A + e)]X and AT + (1 — A)A are representatives of ^a+e and Z\ and the 
distance between both reads 


|(A + e)Y + [1 - (A + e)]X - AT + (1 - A)A| = e|y - A 
Therefore, by ()203l) . 




o.|f(AZA))(z,){F-x) 


A 6 [0,1]. 


Integrating with respect to A e [0,1], we get (llOOp . 


□ 


7.2 Technical remarks on derivatives 

Here we collect several results related with the notion of derivative defined in Definition EH 
The first one is a quantified version of Proposition 12.31 


Proposition 7.3. Assume that t/ : T'^ x ViT'^) M is , that, for some n e N, U{-,m) and 

su 

- — {-jm, •) are in and in x respectively, and that there exists a constant Cn such 

dm 

that, for any m,m' e 'P(T‘^), 

dU 


and 


U{-,m') — U{-,m) — 


5m 

r 5U ^ 
'jd 5m 




(n+a,2) 


^ Cn, 


(204) 

- m){y) 

^ C„df(m,m'). 

(205) 


n+ct 


Fix m e ^(T^) and let (f e be a vector field. Then 


U (•, {id + f))^m) -U{-,m) - / DmU{-,m, y) ■ (f){y) dm{y) 

JT<i 


^ {Cn + 1 


n+a 


2 

L^(m) 


(206) 


Below, we give conditions that ensure that ()205p holds true. 
Proof. Using (j205l) we obtain 


f SXJ 

U (•, {id + (f){\m) -U{-,m) - J ^—{■,m, y)d{{id + 4>)'^m - m) {y) 
^Cnd\{m,{id +(l))'i),m) ^ C'n|l</>|li 2 (m). 


n+ct 


(207) 
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Using the regularity of |^, we obtain, for an {1, • • • , d}-valued tuple i of length \^\ ^ n and for 
any x e T*^, (omitting the dependence with respect to m for simplicity): 




Ifd. 


' 5 m 


5 U 


{x,y)d{{id + 4 >)f,m]{y) - j_ D^ — {x,y)dm{y)- j_ D^DraU{x,y) ■ (t){y) dm{y) 


I Yd 


Yd 


f 

Jld 

ff 

Jo Jt 


dU 


5 U 


(x, y + (f>{y)) - D^ — {x, y) - D^DmU{x, y) ) • (?i(y) dm{y) 





0 Jo Jvi 


DxDy^{x,y + s(l){y)) - Dl.DmU{x,y)^ ■ (j){y) dm{y)ds 
sDiDyDmU{x,y + st(j){y)) 4 i{y) ■ (j){y) dm{y) dsdt ^ C'n!l</>|li 2 (^), 


where we used (I204h in the last line. 

Coming back to (|207l) . this shows that 


D^U 


(■,{id +- D^U{-,m) - [ DiDmU{-,y) ■ 4 >{y) dm{y) 

^ Jjd 


^ Cn 




which proves (j206p but with a = 0. 

The proof of the Holder estimate goes along the same line: if x, x' e then 
J^^Di^{x,y)d{{id + (i))'^m}{y) - J^^Di^{x,y)dm{y) - J^^DiDmU{x,y) ■ (j){y) dm{y) 

- (^J^^Di^{x',y)d{{id +(j))^m}{y) - J^^Di^{x',y)dm{y) 

- [ DiDmU{x',y) ■ (j){y) dm{y)] 

Jjd J 

= “ DiDmU{x,y)^ ■ 4>{y) dm{y) 

- ~ DiDmU{x',y)^ • (j){y) dm{y) 

^ lo Ld s^y)) - DiDmU{x,y)^ ■ (j){y) dm{y)ds 

~ lo Ld ^ ~ ^i^rnU{x', y)^ ■ (j){y) dm{y)ds 

^ [ [ [ s(DlDyDmU{x,y + st(j){y)) 

Jo Jo Jt‘^ ^ 

- DlDyDmU{x',y + st4){y))^4){y) • (/)(y) dm{y) dsdt 

^ Cn|x-xTll</>ili2(™). 


This shows that 

C SU r ~\ f 

/ {iid +(j))\\m}-m (y) - DmU{-,m,y) ■ (p{y) dm{y) 

JYd dm L J JYd 

Plugging this inequality into (12071) shows the result. 

We now give conditions under which (1205 p holds. 


^ C„ 


n+CK 




□ 
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Proposition 7.4. Assume that [/ : T*^ x ^ R is and that, for some n e N*, 

II SU 


5m 




Lip, 


(n-\-(y,n-\-a) 


I 5m 


^ a. 


Then, for any m,m' e V{T‘^), we have 


5U 


U{-,m') -U{-,m) - I —{■,m,y)d{m'-m){y) 


Ijd 


^ C'nd^(m, m'). 


n+Q 


Proof. We only show the Holder regularity: the L® estimates go along the same line and are 
simpler. For any £ e with j^l ^ n and any x,x's T'^, we have 

r AT T 

Dl,U{x,m') — DlXJ{x,m) — / dI,- — {x,m,y)d{m'— m){y) 


Ijd 


5m 


< 


- (^Diu (x', m') - Diu{x, m) - Di^{x', m, y)d{m' - m)(y)^ 

' f , , , , , o5U, 

rd \ ^xj^\^^A-s)m + .sm,y) - D^ — {x,m,y) 


SU^ su^ 

- s)m + sm',y) - Di — {x',m,y) \ ) d{m' - m){y) 


ds 


^ sup 
s,y 


su su 

-DyL>i^(x,(l - s)m + sm',y) - DyDl — {x,m,y) 

SU su 

(1 - s)m + sm', y) - DyDi — {x', m, y) 


5m 


di(m, m') 


^ Lip„ k “ x'|"df(m, m'). 


This proves our claim. 

Proposition 7.5. Assume that U : ViT'^) R is with, for any m, m' e V(T^), 

5U 


□ 


U{m') — U{m)— / — {m,y)d{m' — m){y) 




5m 


1 


5‘^U 


2 Jfd Jfd 5m? 
where w{t) ^0 as t —>■ 0, and that 

II 5U 


(m, y, y')d{m' — m){y)d{m' — m){y' 

5^U 


< 


df (m, m')w[di{m, m ')), 


( 208 ) 


5m 


(m, •) 


5m^ 




^Co. 


( 2 , 2 ) 


Then, for any m e P(T'^) and any vector field (f e L^(m,R'^), we have 
U {{id + (/>)tjm) -U{m) - / DmU (m, y) ■ (f){y) dm{y) 

JTd 

DyDmU{m,y)(j){y) ■ (j){y) dm{y) 




1 
2 

y i 

^ Jfd Jfd 


Dl^^U{m,y,y')(l){y) ■ (j){y') dm{y)dm{y'] 


^ ll</>ili^^(ll<(>llLi), 

where the modulus w depends on w and on Cq. 
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Proof. We argue as in Proposition I7..SI by our assumption, we have 


U {{id + — U{m) — 

6^U 


Ijd 


6 m 


(m, y)d(^{{id + (j))\\m} - {y) 


< 


fjd fjd {y)d(^{{'id + 4>)f>m} - "i) {z) 

dl{m,{id +(f){\m)w{di{m,{id +(l)){\m)) ^ \\4>\\l3 w{\\(I)\\l^). 


Now 


/ 

JT 


'fd 6 m 


m,y)d [{id + 4>)‘^m} — m {y) 

f (^{'m,y + (p{y))-^{m,y)\dm{y) 
Jjd \dm dm J 


fjd 


■ <P{y) + ^d:'l^{'m,y)4>{y) ■ (j){y) + 0{\(p{y)\^) ) dm{y) 


where 


Moreover, 


fjd ■ ^^y) + ^DyDmU{m,y)(j){y) ■ (j){y) + O(|0(y)P)^ dm{y), 

[ |0(l<?^(2/)l^)| dm{y) ^ \\DyDmU\\ [ \(j){y)\^dm{y) ^ C'o||</>||' 

Jjd' ^ ®JTd 


Ilia • 


f f ^ 

Ijd Jjd 6m^ 


I Yd Jy^ \6m‘^ 


{m,y,z)d [{id + (j))‘^m[ — m {y)d [{id + (j))'^m[ — m 

-(rm ,, -1- ,h(i,\ ■y -1- rk(-yW - -^ ^ ^ 

dm^ om^ 

6^U 
6m^ ^ 


I —^ (m, y + (l){y),z + (p{z) ) - (m, y + (j){y), z) - -^(m, y,z + (j){z)) 


m,y,z) I dm{y)dm{z) 




D: 


6^U 




{m,y,z)(f{y) ■ (j){z) + 0{\(j){y)\‘^\(p{z)\ + \(f{y)\\f){z)\‘^) ) dm{y)dm{z) 


lYd JYd 


DmmU{m,y,z)(j){y) ■ (j){z) + O(|(/)(y)|^|0(z)| + \(l){y)\\(j){z)\'^) ]dm{y)dm{z) 


where 


I Yd 


o 


{z)\ + \(i){y)\\(t){z)\^] dm{y)dm{z) 


^ sup||D^^?7(m,-,-)l|(ci)2||0|li3 ^ C'oll'/’llis 


Putting the above estimates together gives the result. □ 

We complete the section by giving conditions under which inequality (j208h holds: 

Proposition 7.6. Assume that the mapping P(T'^) 3 m^ '>') continuous from V{T‘^) 

into (C^(T‘^))^ with a modulus w. Then (j2n8h holds. 
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Proof. We have 


5U 


U{m') — U{m)= / / -—{ifi — . 3)171 + srri',y)d{m'— rn){y) 


lo Jjd, 5m 

/ —{m,y)d{m'-m){y) 

Ifd dm 

m pU 

s -—^((1 — sr)m + srm', y, y')d{m' — m)iy)dim' — m)iy'). 
•d dm^ 


Hence 


f su 

U{m') — U{m)— / -— {m,y)d{m' — m){y) 


1 r r 5^u 

2 Jjd Jjd 5m^ 


ffd 5m 

(m, y, y')d{m' — m){y)d{m' — m){y'] 


1 ri 


/\2 


^ di(m, m') 



'0 JO 

^ di{m,m')‘^w[di{m,m')‘^) 


Dyy'TZ;2(^^ •’') 


drds 


□ 
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